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CHAPTER I 
INTRODUCTION 


1.1. Finite, infinite, and integral inequalities. It will be 
convenient to take some particular and typical inequality aa a 
text for the general remarks which occupy this chapter; and we 
select a remarkable theorem due to Cauchy and usually known 
as ‘ Cauchy’s inequality’. 

Cauchy’s inequality (Theorem 7) is 

(1.1.1) (oi6i + aii 62 + ...+a„6„)* 

^ + + ••• +®n*) (^1^ + 62® + ... +6„®) 

or 

(1.1.2) (Sa,6,)2^Sa,2S6,2, 

1 X 1 

and is true for all real values of a^, & 2 , •.•» ^>n• We 

call Oi, ... the variables of the inequality. Here the number 
of variables is finite, and the inequality states a relation between 
certain finite sums. We call such an inequality an eiemerUary or 
finite inequality. 

The most fundamental inequalities are finite, but we shall also 
be concerned with inequalities which are not finite and involve 
generalisations of the notion of a sum. The most important of such 
generalisations are the infinite sums 

(1.1.3) So,, o. 
and the integral 



(where o and 6 may be finite or infinite). The analogues of (1.1.2) 
corresponding to these generalisations are 

(1.1.6) (So,6,)*g^,®S6,* 

1 1 1 

HI 
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(or the similar formula in which both limits of summation are 
infinite), and 

(1.1.6) f{pc)g{x)di^ gj p{x)dxj g*{x)dx. 

We call (1.1.5) an infinite, and (1.1.6) an irUegrcd, inequality. 

1.2. Notations. We have often to distinguish b^ween dif- 
ferent sets of the variables. Thus in (1.1.2) we distinguish the 

two sets Oj , Oj o„ and 6i , 62 K- I* “ convenient to have 

a shorter notation for sets of variables, and often, instead of 
writing ‘the set a^, a^, ..., o„’ we shall write ‘the set (o)* or 
simply ‘the o’. 

We shall habitually drop suffixes and limits in summations, 
when there is no risk of ambiguity. Thus we shall write 

So 

n 00 00 

for any of So^, !Sa„, S a„; 

1 1 

so that, for example, 

(1.2.1) (So6)*^2o2S6» 

may mean either of (1.1.2) or (1.1.5), according to the context. 

In integral inequalities, the set is replaced by & function; thus 
in passing from (1.1.2) to (1.1.6), (o) and (6) are replaced by 
/ and g. We shall also often omit variables and limits in integrals, 
^ting 

for (1.1.4): so that (1.1.6), for example, will be written as 

( 1 . 2 . 2 ) {Ugdxf^iPdxlgHx. 

The ranges of the variables, whether in sums or integrals, are pre- 
scribed at the beginnings of chapters or sections, or may be 
inferred unambiguously from the context. 

1.3. Positive inequalities. We are interested primarily in 
‘ positive ’ inequalities^ A finite or infinite inequality is positive if 
all variables a, b, ... involved in it are real and non-negative. 
An inequality of this type usually carries with it, as a trivial 

• There are exceptions, as for example in §§ 8 . 8 - 8 . 17 . There the ^positiTe* oases of 
the theorems discussed are relatively trivial. 
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corollary, an apparently more general inequality valid for aU real, 
or even complex, a, 6, Thus from (1.1.2) and the inequality 

(1.3.1) 

valid for all real or complex u, we deduce 

(1.3.2) |Sa6|2^(Slall61)*^Slal2S161*, 

where the a and 6 are arbitrary complex numbers. We shall usually 
be content to state our theorems in the fundamental * positive’ 
form and to leave the derived results to the reader. Occasionally, 
however, when the inequality in question is very important, we 
state it explicitly in its most general form. 

Similar remarks apply to integral inequalities. The independent 
variable x will be real, but wiU (like the variable of summation v) 
take either positive or negative values; while the functions /(x), 
g{x\ ... will generally assume non-negative values only. To such 
an inequality as (1.1.6), true for non-negative /, corresponds 
the more general inequality 

(1.3.3) \ifgdx\^^S\f\^dxi\g\^dx, 

valid for arbitrary complex functions/, g of the real variable x. 

Numbers £, Z, r, s, ... occurring as indices in our theorems are 
real but in general capable of either sign. 

1.4. Homogeneous inequalities. The two sides of (1.1.2) 
are homogeneous fimctions of degree 2 of the a and also of the 6; 
and generally both sides of our inequalities will be homogeneous 
functions, of the same degree, of certain sets of variables. Since 
homogeneous functions of positive degree vanish when all their 
arguments vanish, both sides, if of positive degree, will vanish, 
and so be equal, when the sets concerned consist entirely of O’s. 
Thus (1.1.2) reduces to an equality if all the a, or all the 6, are 0. 

A set consisting entirely of O’s is called a null set, or the null set, 
if the context is unambiguous. In general the ‘ g ’ or * ^ ’ of our 
theorems will reduce to ‘ * when one or aU of the sets involved is 
null. Sometimes this will be the onZy case of equality. More usually 
there will be other cases; thus plainly * occurs in (1.1.2) if every 
a is equal to the corresponding 6. We shall be careful, wherever it 
is possible, to pick out explicitly such oases of equality. 



4 


INTEODTJCTION 


The homogeneity of an inequality in certain sets of variables 
often enables us to simplify our proofs by imposing an additional 
restriction (a rmmaliaation) on them. Thus the means ^(a) of 
1 2.2 aie homogeneous, of degree 0, in the weights p, and we may 
always suppose, if we please, that Sp = l. Again, if we wish to 
prove that { 

(1.4.1) (oi*+a2»+ ... +o„*)W»g {0i'’+02’' + ... 

when 0 < r < « (Theorem 19), we maysuppose (since both 
homogeneous in the a of degree 1) that Sa'’= 1. We havh then 
0/^1, V = (a/)»fr^o/, \ 

and so Ila* ^ Im’' — 1 . Without this preliminary normalisation, our 
proof would run 



There is another sense of ‘ homogeneity ’ which is sometimes 
important. Let us compare (1.4.1) above, which maybe written as 

(1.4.2) (Sa»)i/»g(SaO^, 

with (1.1.2). Both inequalities are homogeneous in the variables, 
but (1.1.2) has a further homogeneity which (1.4.2) has not. It is, 
as we may say, ‘ homogeneous in L ’ ; L, if treated as a number, 
would occur to the same power on the two sides of the inequality. 

The result of this homogeneity in S is that (1.1.2) remains 
true if every sum which occurs is replaced by the corresponding 
mean, i.e. if written in the form 

The importance of this kind of homogeneity will appear •very 
clearly in § 2.10 and §6.4. Roughly, ani inequality which possesses 
it has an integral analogue, while one which does not, like (1.4.2), 
has none. 

1.5. The axiomatic basis of aigebraic inequalities^. Our 
subject is difficult to define precisely, but belongs partly to 
‘algebra’ and partly to ‘analysis’. Algebra or analysis, like 
geometry, may be treated axiomatically. Instead of saying, as 
• See Artin and Schreier (1). 
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for example in Dedekind’s theory of real numbers, that we are 
concerned with such or such definite objects, we may say, as in 
projective geometry, that we are concerned with any system of 
objects which possesses certain properties specified in a set of 
axioms. We do not propose to consider the ‘axiomatics’ of 
different parts of the subject in detail, but it maybe worth while 
to insert a few remarks concerning the axiomatic basis of those 
theorems which, like (1.1.2) and most of the theorems of Ch. II, 
belong properly to algebra. 

We may take as the axioms of an algebra only the ordinary 
laws of addition and multiplication. All our theorems will then 
be true in many different fields, in real algebra, complex algebra, 
or the arithmetic of residues to any modulus. Or we may add 
axioms concerning the solubility of linear equations, axioms 
which secure the existence and uniqueness of difference and 
quotient. Our theorems will then be true in real or complex 
algebra or in arithmetic to a prime modulus. 

In our present subject we are concerned with relations of in- 
eqttality^ a notion peculiar to real algebra. We can secure an 
axiomatic basis for theorems of inequality by taking, in addition 
to the ‘indefinables* and axioms already referred to, one new 
indefinable and two new axioms. We take as indefinable the idea 
of a positive number, and as axioms the two propositions: 

I. Either a is or a is positive or —a is positive^ and these 
possibilities are exclusive. 

II. The sum and product of two positive numbers are positive. 

We say that a is negative if —a is positive, and that a is greater 
(fees) than 6 if a — 6 is positive (negative). Any inequality of a 
purely algebraic type, such as (1.1.2), may be made to rest on 
this foundation. 

1 .6. Comparable functions. We may say that the functions 
/(a)=/(0i,08, ?(a)=flr(ax,a8, — ,o«) 

are comparable if there is an inequality between them valid for 
all non-negative real a, that is to say if either/^ g for all such a or 
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g for all such a. Two given fonotions are not usually compar- 
able. Thus two positive homogeneous pol 3 momials of different 
degrees are certainly not comparable*; if O^f^g for all non- 
negative a, and both sides are homogeneous, then / and g are 
certainly of the same degree. i 

The definition may naturally be extended to functions 
f(a, 6, . . .) of several sets of variables. \ 

We shall be occupied throughout this volume with problems 
concerning the comparability of functions. Thus the Withmetio 
and geometric means of the a are comparable: ®\a)^§l(a) 
(Theorem 9). The functions ®(o+6) and &(a)+&(b) are com- 
parable (Theorem 10). The functions SI (ab) and SI (a) SI (b) are not 
comparable; their relative magnitude depends upon the relations 
of magnitude of the a and 6 (Theorem 43). The functions 

are comparable if and only if is convex or concave 
(Theorem 85). 

An important general theorem concerning the comparability of 
two functions of the form 

due to Muirhead, will be found in §2.18. 

1 .7 . Selection of proofs . The methods of proof which we use 
in different parts of the book will depend on very different sets of 
ideas, and we shall often, particularly in Ch. II, give a number of 
alternative proofs of the same theorem. It may be useful to call 
attention here to certain broad distinctions between the methods 
which we employ. 

In the first place, many of the proofs of Ch. II are ‘strictly 
elementary’, since they depend solely on the ideas and processes 
of finite algebra. We have made it a principle to give at any rate 
one such proof of any really important theorem whose character 
permits it. 

Next we have, even in Ch. II, many proofs which are not 
elementary in this sense because they involve considerations of 

* Compare { 2 . 19 . 
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limits and oontinmty. We have also, partionlarly in Ch. IV, 
proofs which depend upon the standard properties of differential 
coefficients, as for example upon Bolle’s Theorem. All these 
proofs belong to the elements of the theory of functions of one 
real variable. 

Later, when dealing with integrals in Ch. VI, we naturally 
mahe use of the theory of measure and of the integral of Lebesgue. 
This we take for granted, but we give a summary in §§ 6.1-6.3 of 
the parts of the theory which we require. 

Occasionally we appeal to the more remote parts of the theory 
of functions of real variables; but we do this only in alternative 
proofs or in the proo& of theorems of considerable intrinsio 
difficulty. Thus in Ch. IV (§ 4.6) we use the theory of the maxima 
and minima of functions of several variables; in Ch. VII we use 
the methods of the Calculus of Variations; and in Ch. IX we use 
the theory of double and repeated integration. We make no use 
of complex function theory, although, in the last chapters, we 
refer to it occasionally for purposes of illustration. The sections 
in which we do this do not belong properly to the main body of 
the book. 

We add a few further remarks of a more detailed character. 

(i) Cauchy’s inequality (1.1.2) is a proposition of finite algebra, 
as defined in § 1.5. It is a recognised principle that the proof of 
such a theorem should involve only the methods of the theory to 
which it belongs. 

(ii) We shall be continually meeting theorems, such as Holder’s 
inequality 

(1.7.1) So6g(Sa*)V*(S6*')V»' 

(Theorem 13), whose statusdepends upon the valueofaparameter 
k. If it is rational, the theorem is algebraical, and our remarks 
under (i) apply. If 1; is irrational, a*’ is not an algebraical function, 
and it is obvious that there can be no strictly algebraical proof. 

It is however reasonable to demand, when we are concerned 
with an inequalily so fundamental as Holder’s, that our step 
outside algebra shall be the absolute minimum which the nature 
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of the problem neoeesitates. It is plain that this step will depend 
upon our definition of a^. We may define a* as ezp (it log a), and 
in this ease it is obviously legitimate and necessary to use the 
theory of the exponential and logarithmic functions. If, as is 
more usual, we define as the limit of , where is an appro- 
priate rational approximation to k, then this limiting process 
should be the only one to which we appeal. \ 

(iii) Suppose that, adopting the last point of view^ we have 
proved Holder’s inequality, for rational h, in the for^ (1.7.1.). 
We can infer its truth for irrational & by a passage to the limit. 

Such a proof, however, is not usually sufficient for our purpose. 
We always wish to prove a theorem of a more precise type than 
(1.7.1),inwhich(a8 in Theorem 13) we establish strict inequality 
except in certain specified special cases. When we pass to the 
limit, ‘ < * becomes ‘ < ’, we lose touch with the cases of equality 
(though these are in fact the same as in the rational case), and 
our proof is incomplete. It is therefore necessary to arrange our 
proofs in such a manner as to avoid such passages to the limit 
wherever it is possible. The same point arises whenever we wish 
to pass from a finite inequality to the corresponding infinite or 
integral inequality. It recurs at intervals throughout the volume 
and has often determined our choice of a particular line of proof. 

(iv) The general principles which have governed our choice of 
methods are as follows. When a theorem is simple and funda- 
mental, like Theorems 7, 9, or 1 1 , we prove it by several different 
methods, and are careful that one of our methods at any rate 
shall conform to the canons laid down \mder (i) and (ii). When the 
theorem is subsidiary or difficult, or when a proof satisfying these 
conditions would be troublesome orlong, weusewhatevermethod 
seems to us simplest or most instructive. 

1 .8. Selection of subjects. The principleswhioh have guided 
us in our selection of evbjeda may be summarised as follows. 

(i) The first part of the book (Chs. II-VI*) contains a syste- 
matic treatment of a definite subject. Our object has been to 

• Except perhaiis some parts of Ch. IV, 
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discuss thoroughly (with their analogues and extensions) the 
simple inequalities which are ^in daily use’ in analysis. Of these 
three are fundamental, viz. 

(1) the theorem of the arithmetic and geometric means 
(Theorem 9), 

(2) Holder’s inequality (Theorem 11), 

(3) Minkowski’s inequality (Theorem 24); 

and these three theorems dominate the first six chapters. We 
prove them in a variety of ways, in the finite case in Ch. II, in 
the infinite case in Ch. V, and in the integral case in Ch. VI; while 
Ch. Ill (which contains a general account of the theory of convex 
functions) is mainly occupied with their generalisations. In these 
chapters, of which the most important are II, III, and VI, we 
have aimed at a comprehensive and in some ways exhaustive 
treatment. 

(ii) The rest of the book (Chs. VII-X) is written in a different 
spirit and must be judged by different standards. These chapters 
contain a series of essays on subjects suggested by the more 
systematic investigations which precede. In them there is very 
little attempt at system or completeness. They are intended as 
an introduction to certain fields of modem research, and we have 
allowed our personal interests to dominate our choice of topics. 

In spite of this (or because of it) the chapters have a certain 
unity. There is much modern work, in real or complex function- 
theory, in the theory of Fourier series, or in the general theory of 
orthogonal developments, in which the ‘Lebesgue classes 1/^" 
occupy the central position. This work demands a considerable 
mastery of the technique of inequalities; Holder’s and Min- 
kowski’s inequalities, and other more modem and more sophisti- 
cated inequalities of the same general character, are required at 
every turn. Our object has been to write such an introduction to 
this field of analysis as may be made to hang naturally on the 
subject matter of the early chapters. 

(iii) We are interested primarilyin certain parts of realanalysia, 
and not in arithmetic or in algebra for its own sake. The line 



10 


IKTBODtrOTIOK 


between algebra and analysis is often difficult to draw, especially 
in the theory of quadratic or bilinear forms, and we have often 
doubted what to include or reject. We have however excluded 
all developments whose main interest seemed to us to be alge- 
braical. 

We have also excluded function-theory proper, real or coml^lex. 
In the later chapters, however, we have sometimes tried to i^how 
the significance of otur theorems by sketching the lines of soi^e of 
their function-theoretic applications. 

Thus (to give definite examples) our programme excludes 

(1) inequalities of a definitely arithmetical character, such as 
those of the theory of primes, or those which give bounds for 
forms with integral variables; 

(2) inequalities which belong properly to the algebraical theory 
of quadratic forms; 

(3) inequalities, such as ‘Bessel’s inequality’, which belong to 
the theory of orthogonal series; 

(4) inequalities, such as ‘Hadamard’s three circle theorem’, 
which belong to function-theory proper: 

and there is no systematic discussion of geometrical inequalities, 
though we use them frequently for purposes of illustration. 

It may be useful to end this introduction by a few words of 
advice to readers who are anxious to avoid tmnecessary immer- 
sion in detail. The subject, attractive as it is, demands, for the 
writer at any rate, a great deal of attention to details of a rather 
tiresome kind. These details arise particularly in the exclusion of 
exceptional cases, the complete specification of cases of equality, 
and the conventional treatment of zero and infinite values. Such 
a reader as we have in mind may be content, in general, to sim- 
plify his task as follows. (1) He may ignore the distinction be- 
tween non-negcdive and positive, so that the numbers and func- 
tions witb which he is concerned are all positive in the narrow 
sense. (2) He may ignore our conventions concerning ‘infinite 
values*. (3) He may assume that the parameter 1; or r of in- 
equalities such as Holder’s and Minkowski’s is greater than 1. 
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(4) He may take it for granted that ‘what goes for snms goes, 
with the obvious modifications, for integrals’ (or vice versa). He 
should then be able to master what is essential without undue 
trouble. 

This advice for ‘easy reading’ must not be taken too literally. 
It is essential to understand the kind of exceptional oases which 
occur, and the general principles which govern the discrimination 
of cases of equality. It is not a mere academic exercise to pick 
out the cases of equality in such an inequality as Holder’s; a 
knowledge of these cases provides (as is shown very clearly in 
§§ 8.13-8.16) a powerful weapon for the discovery of deep and 
important theorems. Every reader should make it his business 
to explore this inequality at any rate to the end. 



CHAPTER n 


ELEMENTARY MEAN VALUES 

2.1. Ordinary means. In what follows we are concerned 
with sets of n non<n^ative numbers a (or 6, c, . . . ), say \ 

(2.1.1) Oj, 02 ) ■*■> ■**) \ 

and a real parameter r, which we suppose for the present not to 
be zero. 

We denote the ordered series (2.1.1) by (o). When we say 
that ‘(a) is proportional to (6)’ we mean that there are two 
numbers A and /x, not both zero, such that 

(2.1.2) Ao„=/i6„ (i'=l,2,..,,»). 

It will be observed that the null set, the set (o) in which every a is zero, 
is proportional to any (6). Proportionality, as we have defined it, is a 
symmetrical relation between sets but not a transitive one; it becomes 
transitive if we exclude the null set from consideration. 

If (a) and (6) are proportional, and neither of them is null, then 6,,s=0 
whenever 0 ^= 0, and ajh^ is independent of i> for the remaining values of v. 

We write 

( 1 /I " 

except when (i) r=xO or (ii) r < 0 and one or more of the a are 
zero. In the exceptional case (ii), when (2.1.3) has no meaning, 
we define SOlf as zero, so that 
(2.1.4) 2)l,.=0 (r<0, acme o zero)®. 

Here and elsewhere we shall omit the suffixes and limits of sum- 
mation when it can be done without ambiguity. 

In particular we write 

(2.1.6) l=3l(a)=S!Ri(o), 

(2.1.6) $=©(a)=2)l_i(o). 

Finally, we write 

(2.1.7) ®=®(a)=-C^(OiOj...oJ= .(^(na). 

^ * If we sdioitted infinite values, there would be a oonespondinff oue for poiitivefi 
TUB. r>0, some a infinite, xr © r > 
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Thus ^(o), ©(o), @(o) are the ordinary arithmetio, harmonic 
and geometric means. 

We have excluded the caee r^O, but we shall find later (§ 2.3) that we 
can interpret SWo conventionally as (5. We are not generally concerned 
with negative a, but it is sometimes convenient to use ^ (a) without any 
restriction of sign. The definition is unchanged. 

2.2. Weighted means. We shall however usually work with 
a more general system of mean values. We suppose that 


(2.2.1) 

p,>0 

(l^— 1, 2, •••, 72>) 

and write 



(2.2.2) 

9R,=aR,(o) 


(2.2.3) 

o 

il 

(r < 0, some a zero). 

(2.2.4) 

@}=@(a) = 

= @(a,p) = (no*’)W2». 


The equations (2.1.5) and (2.1.6) stand as before, with the ad- 
dition of the sjunbols ^(a,p), ^(a,p). The last remark of §2.1 
applies also to the generalised 31. The weighted means reduce to 
the ordinary means when p, = 1 for every v. 

The means being homogeneous and of degree 0 in the p’s, we 
may suppose, if we please, that Sp‘=l. In this case we shall 
replace p by g; thus 

(2.2.6) aJi,(a) = aJl,(a,g) = (Sgo'-)^ (Sg=l), 

(2.2.6) @(c) = @(o,g)= no« (Sg=l). 

We shall not usually refer to the weights explicitly in our 
formulae, but it is always to be imderstood that mean mines 
which are compared toith one another are formed with the same 
weights. 

Ordinary means are special cases of weighted means. On the 
other hand, weighted means with commensurable weights are 
special cases of ordinary means (with a different S 3 ^tem of a); for 
we may suppose, on account of homogeneity, that the weights 
are integral, and we can derive means with integral weights &om 
ordinary means by replacing every number by an appropriate 
set of equal numbers. Means with incommensurable weights may 
be regarded as limiting cases of ordinary means. 
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The following obvious fonnulae will be used repeatedly: 

(2.2.7) aJi,(a)={3l(an}'^^ 

(2.2.8) (U(a) = c*<‘°«“>. 


(2.2.9) 


i!K-r(«) = i 


1 


■9»,(l/a)’ 

(2.2.10) a)l,,(a) = {3R«(a0r. 

We suppose that o> 0 in (2.2.8), and in the other formul^ if a 
suffix is negative; the formulae may be extended to cover the 
missing cases by appropriate conventions. Also \ 

(2.2.11) ai(a+6) = 9l(a) + Sl(6), 

(2.2.12) (5J(a6) = ©(«)© (6), 

(2.2.13) 2R,(6) = *aR,(a) if(b) = k(a) 

(i.e. if = ka, , where k is independent of v) , 

(2.2.14) &(b) = k&(a) if (b) = k(a), 

(2.2.15) 2K, (o) ^ 9K,. (6) if ^ 6„ for all v. 


2.3. Limiting cases of 3)1, (a). We denote by 
Mina, Maxa 
the smallest and largest value of an a. 

1. Min a < 3)1, (a) < Max a, unlesa either all the a are equal, or 
else r < 0 and an, a is zero. 

It is to be understood here, and in the enunciations of all later 
theorems, that, when we assert that inequalities hold unless some 
particular condition is satisfied, we imply that at least one of the 
inequalities degenerates into an equality in the case excluded. 
Here, for example, Min a =311,(0)= Max a if all a are equal, and 
Mina= 3)l,(a) g Maxa in the other exceptional case. 

We form our means with q. Since 
Sg(a-2l) = 0, 

every a is equal to 81, or else a — 81 is positive for at least one a and 
negative for another. This proves the theorem for r= 1. 

In the general case we may suppose that either a> 0 or else 
r > 0, the oases excluded being trivial. It then follows that 

{3)l,(a)}r=8l(aO 
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lies between (Mina)’’ and (Maxa)’’, which proves the theorem 
generally. 

8. Mina < @ (a) < Max a, unless all the a are equal oranais zero. 
In the second exceptional case @ = 0. If ® > 0 then 



so that every a is ® or at least one is greater and one less than (Si. 

3. lim9Jl,(a)=:@(a). 

*•-►0 

If every a is positive 

an, (a) = exp ^hog Sga’-j 

= exp l^log (1 + rSjloga + 0(r®))| 

->exp (Hq log a) = na« = @ (o), 

when r->0. 

If there are some zero a, b denotes a positive a, and s is a $ 
corresponding to a 6, then 

an,(a,g) = (Sga’’)^= (Ss6’’)J^'= (S«)i(^a«,(6,«)->0 
when r->+0, since 3Jlj.(b,s)->®{b,s) and S3< 1. When r< 0,3)1, 
and ® are both zero, so that the result holds also when 0. 

Our proof depends on the theory of the exponential and 
logarithmic functions. We show in §2.16 how a more ele- 
mentary proof may be foimd if desired. 

4. lim 311,(0)=° Max a, lim 3)l,(a)=:Mina. 

r~»>oo 00 

If is the largest a, or one of the largest, and r > 0, we have 

211 ,( 0 ) 20 *; 

from which the first equation follows at once. The second is 
trivial if any a is zero and follows &om (2.2.9) otherwise. 

We now agree to write 

(2.3.1) 2)lo(o)=@(o), 311®(a)=Maxo, 311_oo(o)=Mina. 

With these conventions, we have 

6. 31l_oo (o) < 2)1, (a) < 3)loo (a) for all finite r, unless the a are all 
equal, or r 2 0 and an a is zero. 
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2.4. Cauchy’s Inequality. It is convenient to prove the next 
theorem here although it will be superseded later by a more 
complete theorem (Theorem 16). 

6. S)Zr(a)<31l2r(a) {r>0),uiile88 aUtheaareequdl, 

The inequality is 

and is a special case of the very important theoreni which 
follows. \ 

7. (Zab)^ < Sa* Zb\ unless (a) and (6) are proportional 

For Ea2E62-(Sa6)2=|S 

ft. V 

An alternative proof is as follows. The quadratic form 
2 {xa 4- yb)^ = Sa* -f 2xy Zab + y^ Zb^ 

is positive for all x, y, and therefore has a negative discrimi- 
nant, unless a;a^-f-y6„=0 for some x, y, not both zero, and all v. 
To deduce Theorem 6, take V? a*’\/p in place of a and 6. 

Theorem 7 may be generalised as follows : 

Za^ Zab ... Zal >0, 

Zla Zlb ... SI* 

unless the sets (a), (6), ..., (1) are linearly dependent, i.e, unless there are 
numbers x,y, not all zero, such that xa^ + yb^ + . . . + = 0/or every v. 

Either proof of Theorem 7 may be extended to prove Theorem 8: we 
may either express the determinant as a sum of squares of determinants, 
or we may consider the non-negative quadratic form 

Z(xa-\-yb + ..,i-wl)^ 

in.x,y, We do not go into details because any systematic discussion 

of inequalities connected with determinants and quadratic forms would 
carry us beyond the limits which we have imposed on the book. 

2.5. The theorem of the arithmetic and geometric 
means. We come now to the most famous theorem of the 
subject. 

» This is what is usually called Cauchy’s inequality: see Cauchy (1, 373). The 
corresponding inequality for integrals (Theorem 181) is usually called Schwarz’s 
inequality, though it seems to have been stated first by Buniakowsky: see Bunia- 
kowsky (1, 4), Schwarz { 2 , 261). 
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9. @ (a) < 3( (a), urdeas aUthea are equal. 

The inequality to be proved may be written in either of the 
forms 

(2.5.1) ... (a£t±:::±a*r 

\ Pl+ — +Pn } 

(2.6.2) ai«i02*«...a„««<Sgo 
(where as usual 1^9= 1). 

This theorem is so fundamental that we propose to give a 
number of proofs, of varying degrees of simplicity and generality. 
Of the two which we give in this section, the first is entirely 
elementary. The second depends on Theorem 3 and so, at present, 
on the theory of the exponential and logarithmic functions. We 
shall show later (§ 2.16) how this proof also may be made to 
conform more strictly to the canons of § 1.7. 

(i)» We have*> 

«. a, . . 

unless so 

with inequality in one place or the other unless 
Repeating the argument m times, we find 

(2.6.3) aia2...a2r?7<|^ j , 

unless all the a are equal. This is (2.5.1) with unit weights and 
n a power of 2. 

Suppose now that n is any number less than 2*”. Taking 


^n+1 ^n+2 ^ 2 ^ ~ 


^=2t, 


and applying (2.6.3) to the 6, we find 

.(“-5±-M)- =a-, 

or < 51^, 


to EuoUd(l;n6. v25). 


Cauchy (1, 375). 
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unless all the 6, and so all the a, are equal. This is (2.6.1) with 

unit weights. We deduce (2.6.1), with any oonunensurable 

weights, by the process explained in §2.2. 

When the weights are incommensurable, we can replace them 

by a set of commensurable approximations, prove (2.6.11) with 

the approximating weights, and proceed to the limit. In this 

process * < ’ is changed into ‘ ^ so that we do not at first abtain 

a complete proof of the theorem. We may complete the propf as 

follows. Write > . » / i o \ \ 

(k^I, 2, ... ,»), \ 

where > 0, g" > 0, and gj, is rational. Then 

r'=2g;, = 

are rational and f'+r" = 1. We have already proved (2.6.1) with 
‘ < ' for rational p, and with ‘ ^ ’ in any case. Hence 

(Sg"a\ 


no«'< 


/Sg'a'v*®' „ 

n<ir-na'n<^< (*, Sq’aJ (i Sq"aJ 

S 4- 2;g"a = Sga. 


Another way of completing the proof was shown us by B». E. A. C. Paley. 
This depends on Theorem 6. From this theorem, the formula (2. 2. 1 0), and 
what has been proved before, it follows that 

9t (o) = 9Jli (o) > aHj (a) = aril* (oi) ^ ©»(«*) = ® (a)- 

(ii)* By Theorems 6 and 3, we have 
?l(o)=aJli(o)>3Jl|(a)>?[)lj(a)>...> lim ?Dlla-m(a)=®(o). 

m->«> 

This proof is very concise but not quite so elementary as the 
first. It may be observed that we require Theorem 3 only in the 
case in which the r of Theorem 3 tends to zero through the special 
sequence of values 2~". 


8.6. Other proofs of the theorem of the means. We shall 
return to Theorem 9 in §§ 2.14-16 and again in § 2.21. We add 
here a few remarks about alternative proofs of the ordinary form 
of the theorem with unit weights. 

* Sohldmiloh (1). 
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(i)» If the a are not all equal, let 

ai=Mina<Maxa=aa. 

If we replace each of and by f (ai+a 2 )» (®) ^ unaltered, 

but 

i"V-) 

so that @ (a) is increased. 

Suppose now that we vary the a in such a manner that 21 is 
constant, and that we assume the existence of a set (a*) for which 
@ attains a maximum value. Then the a* must be equal, since 
if not we can replace them as above by another system for which 
® is greater. It follows that the maximum of ® is 21, and that this 
maximum is attained only for equal a. 

To prove the existence of (a*), let 

••• 

Then ^ is continuous in the closed domain 

• • • > ^n-l = ^n-1 = 

It therefore attains a maximum for some system of values 
ill the domain. 

* This proof, the most familiar of all proofs of the theorem, is due (so far as we have 
been able to trace it) to Maclaorin (2). Maclaiirin states the theorem in geometrical 
language, as follows: *If the Line AB is divided into any Number of Parts AO, CD, 
DE, EB, the Product of all those Parts multiplied into one another will be a Maxi- 
mum when the Parts are equal amongst themselves His proof is substantially that 
which follows. The proof has been rediscovered or reproduced by many later writers, 
for example by Grebe (1), Chrystal (1, 47). 

Cauchy’s proof (§2.5) may be regarded as a more sophisticated form of Mac- 
laurin’s, since ho proves the theorem in the special case when n—2”^ by a process 
similar to Maolaurin’s. In general, Maclaurin’s proof is not a 'unite* proof. As we 
have stated it, it depends on Weierstrass’s theorem on the maximum of a continuous 
function. This would naturally have been taken for granted by Maolaurin (and has 
also been taken for granted by many of his modem followers, such as Grebe and 
Chiystal). 

It is possible to avoid an appeal to Weierstrass’s theorem, but at considerable cost. 
It is plain that if a^; a,*; ... are the smallest and largest of the sets resulting 

from 1, 2, ... repetitions of Maolaurin’s process, then increases and decreases as 
« increases, so that 

fli* aj , Oj , aa S; aj . 

A little consideration will show that n repetitions of the process diminish the greatest 
difference of the a by at least one-half, sothatog" J(a, -Oj). Honceog* 

and »(Xg . It follows that all the a tend to the same hmit This gives a proof of 
the theorem, but one a good deal less simple than that in the text. 
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The reader should work out the analogous proof in which & is 
kept constant and and are each replaced by 


(ii) Tliere is a variation of Cauchy’s proof which illustrates a point of 
some logical importance* 

An ordinary inductive proof proceeds from n to n + 1; the trut)i of a 
proposition P(n) follows from the hypotheses 

(а) P(n) implies P(n-f 1), 

(б) P(n) is true forn= 1. 

There is another mode of proof which may be called proof by ‘backward 
induction*; the truth of P(n) follows from \ 

(o') P(n) implies P(n— 1), 

(6') P(n) is true for an infinity of n. 


Cauchy’s proof may be arranged as a proof of this last type. First, 
Cauchy proves (b') for n = 2*". Next, if the theorem is true for n, and if 31 
is the arithmetic mean of Oj , a, , . . . , o„_i , then an application of the theorem 
to the n numbers Oi , . . o„_i , 31 gives 




the result forn — 1 . 


(iii)*- Defining and Og as in (i), we may replace % and ag by 21 
and + a 2 — 21. Then 21 is again unchanged, and 

2l{ai4-a2-2l)-aia2 = (3l~ai) (a2~2l)>0, 
so that & is increased. Repeating the process we arrive, after at 
most w — 1 steps, at a system of a all equal to 21. It follows that 
®<2l. 

This proof is a little more sophisticated but entirely elementary. 
There is an alternative, which we leave to the reader, in which 
and Ug are replaced by ® and 


(iv) There are a number of inductive proofs of the theorem; see, for 
example, Chrystal (1, 46), Mmrhe&d (3). One of the simplest runs as 
follows**. Suppose that 0<Oi^o,^ ... ^a„, Oi<a„, that 3l„ and refer 
to the first v of the a, and that it has been proved that 3l„^i S; ® n»i • 
Then by Theorem 1, and 


m (n 1 ) 31,t_x _ or 1 ®n 



n 


» For these proofs see Sturm (1, 3), Crawford (1), Briggs and Bryan (1, 186), 
Muirhead (3). Hardy { 1 , 32). 

** Another simple proof due to R. Rado is given at the end of the chapter 
(Theorem 60). 
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Baitiiig this equation to the nth power, and remembering that n> 1 , we 
obtain 

~ So.<g”~? = < 6 ” • 

(v) Another interesting proof wae given very recently by Steffensen 

(1,2). It starts from the lemma: a,,, allv^ 

then 'La'Lh is not decreased by exchanging and , and ia increased except 
when Oi^biOra^^ Kfor v # i. The lemma follows at once from the identity 

{So + ( 6 < — a<)} {S 6 + (a< — 6 <)} 

= So D& -f" {bi “ a^) {(I^ b^) (Sa ■— o^)}* 

To deduce the theorem of the means, we write it in the form 

(Ol + Ui+**»+®l) {ctn + <^n+..*4-On) = (®l + ®t+*** +®n) ••• 

(®1 + ®t + • • • + Un)* 

If we suppose, as we may, that Oj ^ ^ . . . < o„ , and exchange n — 1 terms 

of the first factor of the left-hand side against one term of each of the other 
factors, we obtain 

(ai-haj+aa+.-. + aJ (Ui-f Oa + aa-|-...+aa)---(»i + '^n + On+-**+a*), 

which is greater, by the lemma, unless all the a are equal. The theorem 
follows by repetition of the argument. 

(vi) Further proofs of Theorem 9 (or of the special case considered in 
this section) are given in §§ 2.14, 2.16, 2.21, 3.11, and 4.2. 

2.7. Hdlder’sinequalityanditsextensions. Ournextgroup 
of theorems centres round Theorem 11 (Holder’s inequality)*. 

10. Suppose ffuU (a), (6), ..., (1) are m sets each of n numbers. 
Then 

(2.7.1) @(o) + @(6) + ... + @(l)<®(o+fc + ...+l), 

unless either (1) every two of (a), (6), ..., (Z) are proportional, or 

{2)thereisavsttchtheUay=b,=... = ly=0. 

The theorem states that, if = 1 , then 

•• • ... 6,^®. H* ...■!" ... 

< ( 0 ^+ 61 + ... + Zi)®i (o 2 + 6 j+ ... H-Z,)®* ..., 

unless every two columns of the array 

, b^, Zj 

® 8 > ^ 8 > •••> ^8 

* Striotly, ‘Hdider’s inequality’ is Theorem 14, or (2.8.3) of Theorem 13. The 

inequality (2.7.1) wa« stated explicitly, for two sets and equal weights, by Minkowaki 

(I. 117). 
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are proportional or there is a row containing only zeros. A neces- 
sary and suflSicient condition that all columns should be propor- 
tional (i.e. that every pair of columns should be proportional) is 
that = a^Cy--ayC^=0, for every fi and v; and 

this condition is also necessary and sufficient for the wopor- 
tionality of all rows. If we remember this, change our notawon as 
between rows and columns of the array, and write a, jS, ...i A for 
?i » ? 2 > • • •> ?n > Theorem 10 is equivalent to \ 

11. If oL, ...» Xare positive and 0 L + ^+... +X==l, then \ 

(2.7.2) (2a)« (Lb)P... (Ll)\ 

unless either (1) the sets (a), (6), ..., (Z) are all proportional» or 
(2) one set is null. 

The conditions for equality might also be expressed by saying 
that there is one set which is proportional to all the others (the null 
set being proportional to all other sets). The case in which one set 
is null is trivial, and we may ignore it in the proof. 

Here again we given two proofs. 

(i) By Theorem 7, 

unless (o) and (6) are proportional. Hence 

(Lahcdf ^ < Sa*26«Sc*Sd* 

with inequality somewhere unless (a), (6), (c), (d) are proportional®. 
Repeating the argument we see that 

(2.7.3) (Sa6 . . . If”' < . . . Si*", 

with 2™ sets (a), (6), ..., unless all the sets are proportional. This 
is equivalent to (2.7.2) when every index is 2“™. 

Suppose next that M is any number less than 2®, and let 

(g) be the ifth set. If (ab...g) is not null, we define A, B L 

by (Mset8), 

H^=K^”' = ... = U^ = ab...g {2”'-Msets), 


so that AB ... L=ah ...g, and apply (2.7.3) to A,B,...,L. We 


thus obtain 
or 


(Lab... g)^ < Sa" . . . Lg^ (Lab... gf”'-^ 


(2.7.4) (Lab ...g)^<La^Lb^ ...Lg^, 


* The null set being excluded, proportionality is now transitive: see §2.1, 
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unless the sets {A), {B), {L), and so the sets (a), (6), {g), 

are proportional. This is equivalent to (2.7.2) -with every index 
IjM. We have supposed {ab...g) not null; if it is null then (2.7.4) 
is obviously true, since none of (a), (6), .... (g) is null. 

If now «, P, ... are rational, we can write 


where a', jS', ... are integers and ha.' =M. Applying (2.7.2), with 
every index 1/Af, to M sets formed by a' like sets of a, j8' sets of 
b, and so on, we obtain (2.7.2) with indices a,)3, .... 

Finally, when a, jS, ... are not all rational, we replace them by 
rational approximations whose sum is 1, form (2.7.2) for these 
rational indices, and proceed to the limit. In this process ‘ < ’ 
degenerates into ‘ ^ ’ and, as in § 2.5 (i), we do not at first obtain 
a complete proof. We can complete the proof as follows. We 
can write a=ai+a 2 , P—Pi + P^, •••, where all the numbers are 
positive and those with suffix 1 are rational. If then Sai = ai, 
Sa2=<72, so that CTi + ctj^I, and 
we have = SPi“i Pg". S (SPi)"> (SPg)"*. 

Since cti, Pi, ... are rational 

hPi = < (So)®!'"! . . . (SZ)V"i; 

while for SP 2 we have a similar inequality, but with ‘ g ’ only. 
Combining our results we obtain (2.7.2). 

(ii) We may deduce Theorem 1 1 from Theorem 9. We have in 
fact (since no set is null) 
ha«‘bP...P 

(Sa)“(S6)^...(SZ)^ W w 


There can be equality only if 

^ h. Ll 

ha''''hb'' ''hi 


(v-1,2,. 


»). 


i.e. if (a), (6), ..., (Z) are proportional. 
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It will be observed that, whether a, /3, ... are rational or not, 
no limiting processes are involved in the proof beyond those 
already present in the proof of Theorem 9. The principle of the 
proof is the same as that of the proof of Theorem 13 below given 
independently by Francis and Littlewood® (1) and F. Bi^z (6). 

2.8. Hdlder’s inequality and its extensions {cmti^ued). 
If we suppose r + 0, and replace a, 6, . . . , 1 in Theorem 1 1 bj^ 
qWP, . . . , we obtain \ 

12. Ifr, a, p, X are positive arid «. + P + ... + X= 1, then 
9K,(o6 ... 1) < ...Wtrix(l) 

unless (6^^), ..., (l^^) are proportional or one of the factors on 
the right-hand side is zero. Ifr<0, the inequality is reversed. 

It is to be observed that, when r > 0, the second exceptional 
case ooovus only if one of the sets (o), (6), ... is null, whereas when 
r<0 it occurs if any number of any set is zero. When r = 0 
there is equality in any case. 

We shall often find it convenient, when we are concerned with 
two sets of numbers only, to use the notation 

(2.8.1) 

h being any real number except 1. The relation (2.8.1) may also 
be written in the symmetrical forms 

(2.8.2) (i-l)(fc'-l)=l, l + l 

(the last form failing when k = Q,k' = 0). We say that k and k' are 
conjugate. 

18. Suppose that ib 0, 1; 4= 1, and that k' is conjugate to k. Then 

(2.8.3) So6 < (So*)!^* (26*^)!'*' (i > 1) 
unless (o*) and {b^) are proportioned', and 

(2.8.4) Safe > (Ea**)!/* (26*')"** (* < 1) 
unless eiiher (o*) and (b^) are proportional or (ab) is null. 


See Hardy (8). 
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Cauchy’s inequality (Theorem 7) is the special case i = Jfc' = 2, 
in which h is conjugate to itself. 

(i) Suppose that *>1. Then (2,8.3) is the special case of 
Theorem 11 in which there are two sets of letters and a= 1/1:, 

1/1:'. This is the ordinary form of Holder’s inequalitsr*. 

(ii) Suppose that 0 < 1: < 1, so that 1:' < 0. If any 6 is 0 then 
the second factor on the right-hand side of (2.8.4) is, as in 
§2,1, to be interpreted as 0, so that (2.8.4) is true unless (ab) 
is null. If every 6 is positive, we define I, u, v by 

1 = 1 / 1 :, 

so that 1>1, k'=—kl’ 

and u=[ab)^, v=b~^, 

so that ah = u*, a’‘=uv, = 

Then (2.8.4) reduces to (2.8.3) with u, v, I in place of a, b, k. 
The exceptional case is that in which (u') and (v' ), i.e. (ab) and 
(6*''), are proportional. If this is so then (since the b are now all 
positive) the sets (a) and (6*^~^), smd therefore the sets (a**) and 
(b’^), are proportional. 

(iii) If 1:<0, then 0 <!:'<!. This case is reduced to (ii) by 
exchanging a and 6, k and k'. Both (ii) and (iii) are included 
in (2.8.4). 

The inequalities remain true in the excluded cases k=0,k=l 
if we adopt appropriate conventions. If 1:=0, 1:' = 0, we must 
interpret (2.8.4) as 

^ U'(Uj ... O/^b^ ... 6^)^*, 

If 1:= 1 we may interpret k' as H-oo or as — oo. In the first case 
we interpret (2.8.3) as ]Sa&<Max&Za, and in the second we 
interpret (2.8.4) as So6 > Min b So, We may leave it to the reader 
to pick out the cases of equality. 

We can combine (2,8.3) and (2.8.4) in the single inequality 

(2.8.5) (So6)**' < (So*)*' (Sft*’')* (&+ 0, 1:+ 1). 

In view of the extreme importance of Holder’s inequality, we 

* H6lder (1). Holder states tbe theorem in a leas qmmetrioal form ghrsn a little 
esflier by Rogers (1). 
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depart from our usual practice here and state explicitly the 
derivative theorem for complex a, 6. 

14. // fc > 1, and K is the conjugaie of k, then 

There is equality if and only if {\ap\^) and ( 1 6^ 1*^) propclHional 

and arg a^b^ is independent of v. \ 

The only additional remark needed for the proof is that 
lSa6|<S|a6| 

unless arga^fc^ is independent of v. We regard 0 as having any 
argument we please. 

The following variant of the first part of Theorem 13 is some- 
times called ‘the converse of Holder’s inequality’. 

16. Suppose that k>l, that ¥ is conjugate to h, and that B>0. 
Then a necessary and sufficient condition that ha^^A is that 
Sa6 ^ for all bfor which ^ B. 

The condition is necessary, by (2.8.3). If 'La^>Ay we can 
choose the b so that JS and (6^') is proportional to (a*), and 

Eo6 = (So*)y* JSV*'. 

Hence the condition is also sufficient. 

Theorem 15 is often useful for the purpose of determining an 
upper bound for Sa*. Any argument based on it can be changed 
into one which involves only a special (6), but the form stated 
here, with arbitrary (6), is sometimes more convenient*. 

2.9. General properties of the means We can now 
prove a theorem which completes and st^rsedes some of those 
of §§2.3-4. 

IB.** IfrKsihea 

(2.9.1) 2R,(o)<9Ji,(o), 

unless the a are aU equal, or s^O and an a is zero. 

We have proved this already in the special oases (i) r=s — oo 

• Compare $$6.9 (p. 142) and 6.13 (p. 149). 

Schl&milch (1). See aliso Eeynaud and Buhamel (1, 156) and Cbrystal (1, 48). 
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(Theorem 5), (ii) « = + oo (Theorems), (iii) r = 0, « = 1 (Theorem 9), 
(iv) «= 2r (Theorem 6). 

Suppose first that 0<r<a, and write r=8a, so that 0<a< 1, 

and , 

pa*—u, p = v, 

so that 0 > 0 and pa»“ = (po»)“pi-“ = 

Then 

(2.9.2) < (Stt)“ (2?;)i-“ 

by Theorem 11 , unless is independent of v, i.e. unless is 
independent of v. Hence 

\ tp ) <1-^/ * 

which is (2.9.1). 

The cases in which r ^ 0 and an a is zero are trivial and we may 
ignore them. If every a is positive, and r = 0 < «, we have 

(9Ro («))* = (o))* = ® (a*) < SI (a*) = (9». (a))*, 
by Theorem 9 and (2.2.7). The two remaining cases, r < a < 0 and 
r < a = 0, reduce to those already discussed in virtue of (2.2.9). 


17.» IfO<r<a<tthen 
( 2 . 9 . 3 ) 

uideas aXlthea which are not zero are equal. 

We restrict the parameters to be positive, the complications 

introduced by negative or zero values being hardly worth 

pursuing systematically. 

We may write ^ 

^ a=ra+f(l — a) (0<a<l). 


The inequality is then 

Sgo* < (Sg-aO" (SgoO^-“ 

and reduces to a case of Theorem 11 when we write 
voiqa*. The condition for equality is that {u) and (v) should be 
proportional, and this is plainly equivalent to that stated in the 
enunciation. The reader should observe the difference between 
the conditions for equality in Theorems 16 and 17. 


Liftpounoff (1, 2)« 
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We shall see later (§3.6, Theorem 87) that Theorem 17 may 
be stated in a more striking form. 

2.10. The sums (i) We write 

©, = (S,(a) = (2o0^^ (r>0). 

We confine our attention to positive r, leaving the oonstmotion 
of a theory of for r ^ 0 as an exercise to the reader. 

18. If0<r<8 <tthen 

( 2 . 10 . 1 ) 

unless all the a which are not zero are equal. 

This is essentially the same theorem as Theorem 17. In fact 

(2.10.2) @,(a)=n«^2R,(o), 

the mean Wtr(a) being formed with unit weights, and (2.10.1) 
reduces to (2.9.3), the powers of n disappearing. 

The correspondence between Theorems 17 and 18 depends 
essentially on the fact that (2.9.3) and (2.10.1) are homogeneous 
in the second sense of § 1.4, namely in the sign S. There is a 
theorem for sums corresponding to Theorem 16, but in this 
theorem, which is expressed by (2.10.3) below, the sign of 
inequality is reversed; (2.10.3) is not homogeneous in 2, and 
is not related to (2.9.1) as (2.10.1) is related to (2.9.3). 

19. ® If 0<r<s then 

(2.10.3) ©,(«)<©,(»), 
unless all the a but one are zero. 

Since the inequality is homogeneous in the a, we may suppose 
So'‘= 1, i.e. ©,= l.*> Then o,,< 1 for every v, and so af-^aj and 

If more than one a is positive then at least one positive a is less 
than 1, and then there is inequality. Theorem 19 is usually 
quoted as Jmisen’s inequality. 

(ii) We add the theorems for corresponding to Theorems 

4 and 3. 

* Pringsheim (1), Jensea (2). Pringsheim attributes his second proof to LUroth. 

^ Compare the remarks on this proof in 
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20. @,-»-Maxa whm r-*-co. 

21. when r-*-0, unless dU the a bid one are zero. 

Theorem 20 follows from (2.10.2) and Theorem 4. To prove 

Theorem 21 we have only to observe that So’’® JV +o(l), where 
N is the number of positive a, 

(iii) Theorem 19, combined with Theorem 1 1 , gives the follow- 
ing theorem of Jensen*. 

22. If «, jS, ..., A are ‘positive and a-l-j8+... + A> 1, then 

< (Sa)“ (26)^ . . . (Sl)\ 

unless every number of one set or all but one of each set is zero, and, 
in the latter case, those which are positive have the same rank, 
Wecanwritea=l:a', j3=i:)S', ..., where !:> landa'-l-/S'-|- ... = 1. 
Ifthena^^.d, ...,wehave 

Sa“6^... ... ... (LLf 

= (Safc)“* . . . (SZ*)^* < {Sa)“ . . . (SZ)^, 
by Theorems 11 and 19. There is inequality somewhere unless 
the conditions for equality in both theorems are satisfied. 

(iv) It is natural to consider weighted sums 

a; = 2,(0) = a;{o,p) = (Spa--)^'' . 

It is plain that there can be no universal relation of the type (2.9.1) or 
(2.10.3), since %r is the Qr of Theorem 19 when p„=: 1 and is 9[Rr when 
Sp^=l. The possibilities in this direction are settled by the following 
theorem. 

23. A necessary and sufficient condition that 

(2.10.4) (0<r<tf), 

for given weights p and all a, is that Lp g 1. There is then inequality unless 
(a) is null, or Sp = 1 and all the a are equal* 

A necessary and sufficient condition that 
(2.10.6) {0<r<s), 

for given weights p and all a, is thatp^^ I for every v. There is then inequality 
unless (a) is null, or a* > 0, p* = 1, and the remaining a are zero. 

(i) If we take a^= 1 for every v, then %r^(S,p)^^^, and (2.10.4) can be 
true only if Up ^ 1 . If this condition is satisfied, and r = sa, so that 0 < a < 1 , 
we have Spa*^ = S (pa*)« p'~« S (Spa»)“ (Sp)i-“ ^ {^pa*)^, 

which is (2.10.4). The conditions for equality are plainly as stated. 

• Jensen (2). 
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(ii) If we take a*= 1 and the other a zero, and (2.10.6) can 

be true only if ^ 1. If we assume that this condition is satisfied, write 
so that j3> 1, and assume, a<s we may on grounds of homogeneity, 
that 1, then^a*’^ 1 for every v and 

Spa* = S g S {pa!^)^ g Spa*', ^ 

which is (2.10.5). The conditions for equality are again plainly a^ stated. 

2.11. Minkowski’s inequality. Our next theorem is V gene- 
ralisation of Theorem 10. \ 

24. Suppose that r is finite and not equal to 1. Then 

( 2 . 11 . 1 ) 

aR,(a) + 2R,(6) + ... + 9Jl,(?)>2K,(a^ (r>l), 

( 2 . 11 . 2 ) 

SDl,i(u) + 3!Ky (6) -f* ... 4"9!Jiy(Z) <91i,.(tt + 64- ... 4*^) (/*< 1), 

unless (a), (6), (Z) are proportionaly or r^O and 

a^ = 6^=... = Z^ = 0 

for some v. 

There is equality for any a, by ... when r= 1. Theorem 10 is 
the special case r = 0. The main result remains true (and is 
trivial) when r=oo or r = — cx), except that the conditions for 
equality require a restatement which may be left to the reader. 
We take the means with g, and write 

a4-6-f ... + Z=5, SW^(5) = 5^. 

Then = Sga’’ = -f- 4- ... 4- SgZ^’’”^ 

= S {q^^a) {q^sy-^ + ... + 2 (q^H) (gVr^)r-i, 

Suppose first that r> 1. Appl 3 dng (2.8.3) of Theorem 13 to 
each sum on the right, we obtain 

(2.11.3) 8^ ^ (Zqa^)^ (I,q8^)^ 4- ... = ((Sga0^*^4- ...). 

There is equality only if {qa% (g6**), ... are all proportional to 
(q^)y i.e. if (a), (6), . . . are proportional. Since 8 is positive (except 
in the trivial case when every set is null), this establishes (2.1 1. 1)». 

Suppose next that 0<r< 1. Unless all the sets (a), (6), ... are 
null, ^,,>0 for some v. K 5^=0 for any particular v, then 


ft This proof is due to F. Biesz (1, 45). 
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o„=6y=...=Zy=0, and we may omit thatTalue of v from con- 
sideration. We may therefore argue as if 0 for every v. In 
that case (2.8.4) of Theorem 13 gives (2.11.3) with the sign of 
inequality reversed, and the proof may be completed as before. 

Finally, suppose that r < 0. If any is zero, all the means are 
zero; we may therefore assume that «„ > 0 for every v. If any o„ is 
zero, 9)1, (o) = 0, and we may omit the letter afi We may therefore 
argue on the assumption that every a, 6, ... is positive, and then 
again everjiihing follows from (2.8.4) of Theorem 13. 

When the q are equal, we obtain 

25. Ifris finite and not equal to 0 or I, then 

(2.11.4) 

(2 (o + 6 + . . . + Vf)^ < (So')!^ + . . . + (S?)^ (r > 1). 

(2.11.5) 

(2 (o + 6 + . . . + if)^ > (20'-)^ +...+ ^r<i), 
unless (a), (6), ..., (1) are proportional, or r < 0 and a^, 6,, ..., l„are 
all zero for some v. 

It is (2.11.4) which is usually called Minkowski’s iaequality**. 
Theorem 24 is more general than Theorem 25 in appearance 
only, since it may be deduced from Theorem 25 by writing 
p^a, p^b, ... for a, b, 

Theorem 24 may be given a very elegant symmetrical form®. 

26. Suppose that 9Jl^> denotes a mean taken tvith respect to the 
suffix p, with weights p^, and 9)1^’’) one taken with respect to v voiOh 
weights and that 0 < r < a < cx). Then 

aK.<‘’)9Jl,(/*>(c^J < 9)i,(#‘>9RW(v). 

except when a^y =b^Cy. 

The result Judds generally for aU r, s such that r<s, except for the 
specification of the cases of equality. 

* Here we use (2.2.15). ^ Minkowski (1, 115-117). 

^ Theorem 26 was communicated to ns in 1929 by Mr A. E. Ingham. The same 
formulation of Minkowski’s inequality was found independently by Jessen and 
published in his paper 1. This and his later papers 2 and 3 contain many interesting 
generalisations: see Theorems 136 and 137. 

a 'We depart here from our usual conyention about qi Hq is not necessarily 1 
(though we prove the inequality by transforming it into one in which we may sup* 
pose Sgsl)* 
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We prove the theorem for 0<r<s<oo, leaving the other oases 
to the reader. There are various supplementary cases of equality 
when r g 0 or one of r and 3 is infinite. 

Let 8lr^k>l and Then the inequality to be 

proved is \ 



This reduces to (2.11.1) when 1, and, being homogeneous 
in the g, is true without this restriction. 


2.12. A companion to Minkowski’s inequality. The 
theorem which follows is an analogue of Theorem 25 of a simpler 
kind, 

27. If T is positive and not equal to 1 then 

(2.12.1) 2 (ft 4* ft + . . . + i)** > Stt** -f- + . . . + Sif^ (r>l), 

(2.12.2) S(ft4-ft + ...+i)’’<Sft’' + Sft»'+... + LJ** (0<r<l), 

unless oil numbers but one of each set a^, by, ..., ly (v= 1, 2, ..., n) 
are zero. 

This follows at once from Theorem 19, since for example 
(ft-f’ft-f’***'!’ Vf > ft** 4* ft** 4" ... 4” Z** 

ifr> 1, unless all of ft, ft, ..., I but one are zero. It should be noticed 
that the sense of (2.12.1) and (2.12.2) is opposite to that of 
(2.11.4) and (2.11.5). 

What is usually required in practice is a combination of (2. 11.4) 
and (2.12.2), viz. 

28. // r > 0 then 

(2 (ft 4 ft + ... 4- lY)^ ^ (2ft**)^ 4 (260^ 4 ... 4 (2Z**)«, 
where 1 i/0<rg 1 and jB= 1/r i/r> 1. 


2.13. Illustrations and applications of the fundamental 
inequalities, (i) Oeometrical interpretations of Holder's and Min- 
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kowahVa ineq'mlitiea. Two particularly simple cases of Holder’s 
and Minkowski’s inequalities are 

(2. 13. 1 ) + ^1^2 + 212 : 2)2 < (Xt^ -f 4- 2i2) {x^^ + ^ 2 ^ + 

( 2 . 13.2) V{(^i + ^ 2 )^ + ( 2/1 + y^Y + ( 2 i + z^Y) 

< + Vi + ^ 1 ) 4- \/( V 4- 2 / 2 ^ 4- 

These hold for all real values of the variables, and express the 
facts that (1) the cosine of a real angle is numerically less than 1, 
and (2) the sum of two sides of a triangle is greater than the third 
side. The exceptional cases are those in which (1) the vectors 
2 i) and (x^, z^) are parallel (with the same or opposite 

senses), and (2) the vectors are parallel and have the same sense. 

The ordinary form of Minkowski’s inequality is the extension 
of (2.13.2) to space of n dimensions with a generalised definition 
of distance, viz. 

P i-P2~ ( I 1^4" I j/i — j/ 2 1*^4- ...)^*’ (^^ !)• 

The most obvious extensions of (2.13.1) are connected not with 
Holder’s inequality for general r but with a generalisation of the 
case r = 2 in a different direction. 

29. If Ita^^x^x^, where is a positive quadratic form 

(with real, but not necessarily positive, coefficients), then 

unless (x) and (y) are proportional. 

This is an immediate consequence of the fact that 

(Ax^ + ny^) (A*^ + fiy,) 

is positive: compare the second proof of Theorem 7. It represents 
geometrically an extension of (2.13.1) to Ti-dimensional space, 
with oblique coordinates or a non-Euclidean metric. 

To illustrate Theorem 15 , take h=2, A = P, J5=l, and 
rectangular coordinates. The theorem then asserts that, if 
the length of the projection of a vector along an arbitrary 
direction does not exceed I, the length of the vector does not 
exceed 1. 
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(ii) A theorem of Hadarmrd^. In our next theorem also we a>re 
concerned with a set of numbers real but not necessarily 
positive. 


30. If Die the determinant whose constituents are 


then 

(2.13.3) 

There is equality only when 

(2.13.4) **• ® 

for every distinct 'pair (i, v, or when one of the factors on the right- 
hand side of (2.13.3) vanishes. 

The geometrical significance of the theorem is that the volume 
of a parallelepiped in n-space does not exceed the product of the 
edges diverging from one comer, and that there is equality only 
when they are orthogonal or an edge vanishes. 

Suppose that where is a positive quadratic 

form, and that A is the determinant whose constituents are . 
Then the equation 

(2.13.6) Cu-A 1,12 =0 

<,21 ^ 


has n positive roots'’ whose sum is and whose product is A. 
Hence, by Theorem 9, 

(2.13.6) ^g^ Cii+C23±- . l^»>» j”. 

If > 0 for all ft, then the form 

^ viC^)*'**"*^^'*’'*'**” 

is also positive; and if we apply (2.13.6) to this form, we obtain 

(2.13.7) A^CllC22...C,jn* 

* Hadsmard (1) consideni determinants with oomplez constituents. Theorem 
30 was found earlier by Kelvin and proved by Muir (1). 

*» See Bdoher (1, 171). 
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This is substantially equivalent to Hadamard’s theorem. For 
the form 

^ ^2 “t* • • • "f” ^riK^n )^ ~ 

is positive unless i>=:0. Aho ii-D^axid 
SO that (2.13.7) is (2.13.3). 

For equality in (2.13.6), all the roots of (2.13,5) must be equal, 
which is only possible if c^y = 0 whenever and is inde- 
pendent of ft. Hence, for equality in (2.13.7), we must have 
G^y — 0 for ft+p, (7^^ independent of ft. The last condition is 
certainly satisfied, since 0^;*= 1, and C^y — 0 is c^y=0, which is 
(2.13.4). 

We can extend the theorem to determinants with complex 
constituents by using Hermitian instead of quadratic forms. 
Further extensions have been made by Schur (2)*.* 

The following ingenious proof of (2.13.7) is due to Oppenheim^. 
Oppenheim’s argument establishes not only (2.13.7), and so 
Hadamard’s theorem, but also the inequalities (2.13.8) and 
(2.13.9) below, due to Minkowski® and Fischer‘S respectively. 

Any two positive quadratic forms may be 

reduced simultaneously, by a linear transformation of deter- 
minant unity, to sums of squares®, say where 

Cy and dy are positive. Then is reduced to 

and the determinants ... of the forms 

satisfy 

I ^ik I ~ HCy , I d^lg I = Tidy , I *4 d^^ I = n {Cy + dy). 

Hence, applsdng Theorem 10 to the sets (c^), {dy), we obtain 
(2.13.8) I I + 1 1 g I c« + d^u r». 

Suppose now that the matrix of the d is formed from that of 
the c by multiplying, first the first r rows, and then the first r 

* See also A. L. Dixon (1). 

^ Oppenheim (2). ^ Minkowski (2). 

d Eiaoher U). ® See Bdcher (1, 171). 

3-2 
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columns, by — 1.* If then we divide (2.13.8) by 2, amd raise to 
tile nth power, we obtain 

(2.13.9) lc<fc| = | 

^11 ••• ^n» 1 ^ 1 ^11 ^rr I I ^r+l,r+l ••• ^nn 1> 

where 1 | denotes the north-west diagonal minor ofjr rows 

and columns in | |, and | 1 denotes the comple- 

mentary south-east minor. Repeating the argument, replacing 
each of the factors on the right-hand side of (2.13.9) by two 
factors, and so on, we ultimately obtain (2.13.7). \ 

(iii) The modulus of a matrix. Suppose that A and B ar^ the 
matrices of n rows and columns whose elements are and 6^^; 
the elements may be complex. The matrices A +B and BA are 
defined as the matrices whose elements are 

^fiv ^fiv » ^fx2 ^2v "1“ • • • “f" ^'nv * 

31-^ If\A\ , the modulus of the matrix A , is defined by 

tkm |4 + B|<|^| + |B|, \BA\i\B\\A\. 

The first inequality is an immediate consequence of Theorem 
25 , with r= 2. The second follows from Theorem 7, since 

(iv) Maxima and minima in eUramtary geometry. We quote (as ex- 
ercises for the reader) a few of the numerous applications of the funda- 
mental inequalities to problems of elementary geometry. 

32. The area of a triangle of given perimeter 2p is a maximum if the sides 
a, 6, c are equal, 

[Apply Theorem 9 to p — a, p — 6, p — c.] 

88. If the surface of a rectangular parallelepiped is given^ the volume is 
greatest when the parallelepiped is a cube, 

[Denote the edges diverging from a comer by a, 6, c and apply Theorem 9 
to bCf ca, ab. There is an analogous theorem for a parallelepiped in n 
dimensions; if k<n, and the surface of the A;-dimensional boundary is 
given, the volume is greatest when the parallelepiped is rectangular and 
its edges are equal. This may be proved by combining Theorems 9 and 30 
with identities between determinants.] 

» Thus is formed from by replacing Xj^{i, k^z 1, 2, ... , r) 

by — — atn (and is therefore positive if is positive). 

bSeeWedWbumd). 
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84* Definition. Si la Base d'une Pyramide esl drconacriptihle d un CercU ; 
etaile Pie de la Hauteur eat an Centre de ce Oerde : PappeUe ceUe Pyramide 
droite, 

Dana une Pyramide droite toutea lea Facea ant une mime Hauteur, at aont 
egalement incliniea a/u Plan de la Baae, 

Theoremei Soient deux Pyramides de mime Hauteur, dord lea Baaea aont 
egdlea tant en Surfcuie qu'en Contour; que Vune soit droite et que VaiUre ne U 
aoU paa : fafftrme que la Surface de la premiere Pyramide eat plua petite que 
la Surface de la aeconde. [Lhuilier (1, 116 ).] 

[Let h be the height, h„ a side of the base, and the perpendictilar 
from the foot of the altitude on to Then the lateral surface of the 
second pyramid is 

i > i ViC^hh,,)^ + l^P.Kn 

by (2. 1 1.4) of Theorem 25, unless all the p„ are equal.] 

(v) Some inequaUtiea uaeful in elementary analyaia. The following 
theorems, which are easy deductions from Theorem 9, are fundamental in 
the theory of the exponential and logarithmic functions. 

35. Ifi > —m, 0<m<n, tlien 

If alao (<m, then 

\ m) nj 

36. If (>0, 0<m<n, then 

n(fi/«~l)<m(fi/»"-l). 

We have, by Theorem 9, 

m n—m 

(i+lYi - 

\ mj n \ mj n n 

If f <w», we may write —( for f. This proves Theorem 35. Theorem 36 

follows from Theorem 35 if we replace f in Theorem 36 by ■ 

2.14. Inductive proofs of the fundamental inequalities. 
Our fundamental theorems are Theorems 9, 10 (or 11), and 24 
(or 25), which we refer to shortly as G, H, M. We deduced H 
from O* and M from H; 6 is a limiting case of H, H a special 
case, or anticipation, of M. 

The simplest case of G is 

87. (Go): o“6^<o«+6/S (a+/S=l). 

We show first that G can be deduced from G, by induction. 

* Though giving also an independent proof of H. 
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Suppose that G has been proved for m letters o, h, . . . , A; (or for 
any smaller number), and that 

(X+/3+ ... + #e + A= 1, ot+]8 + ... + 

Then 

... ^* 1 ^== ( 

^ <7 -j- lA ^ uoc -(-6j3~|" ... + 1 jA+ ZA, 

by G for 2 and for m letters. There is equality in the final result 

only if a=b = ... = k, \ 

i.e. if all letters are equal. Hence G is true for m + 1 letters. 

The simplest cases of H and M are 

88. (H„): 

(t^b^ < (ctj + O2)** (^1 "i” ^2)^ (ix + ^ = 1 ). 

89. (Mo): 

{{Oi + hr + (02 + hY)^ < K' + “2’’)^ + (fcr + (r > 1 ) 
(with a reversed inequality when r < 1). We can deduce Hq from 
Gq and Mq from Hq by specialising our deductions of H from G 
and of M from H. We can also deduce H and M from Ho and Mo 
by induction, but, since these inductive proofs are not essential 
to our argument, we need only sketch them. 

(i) We have 

Oi“6i^ + 02“6/ + Oj* V ^ K + 02 )“ (^1 + * 2 )^ + 03“^8^ 

^ (®1 + 02^" Os)* (^1 + ^2 "i" ^s)^' 

The process may be repeated, and there is no difficulty in picking 
out the oases of equality. We thus obtain (2.8.3) of Theorem 13 
(H for two sets of n numbers). 

Next, if a+^+y= 1, a+j3 = (T, we have 

(a“'<'6^/«')‘'cy ^ (So“'‘'6Z»/T(Sc)5' g (Sa)“(S6)/>(Sc)v. 
This process also may be repeated, and leads to the general form 
ofH. 

We may arrange the induction differently, increasing the 
numbmr of sets first. The intermediate generalisation (H for any 
number of sets of two numbers) is worth separate statement. 
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40. If ct+^+ ...+Xsz\^then 

ai“6/ . . . Z/ + ...l/< (oj + aa)« (b^ + b^f ...(l^ + 1^)\ 
unleaa aja^ =bjb^ = ...= IJl^ or one of the sets is wutt. 

(ii) Similax'ly we can generalise Mq in two directions. On the 
one hand 

{(®1 + + Cl )’’ + (®2 + ^2 + ^ 2 )*^}^ 

< {a{ + ^ c jr + + c^yyiT 
g (a{ + af)^ + {b{ + 6/)Vr + (c/ + Cf)^, 
and on the other 

{(»! +biY+ (a2 + b^Y + (aj + b^YY^ 

S [{(a/ + a/)i/' + (6/ + bf)V^Y + ( 03 + 63 r]ifr 

S (o/ + a3*- + a3')W'+ {bf+bf+bf)Vr 

Repeating and combining these processes, we arrive at the 
general case. 

2.15. Elementary inequalities connected with Theo- 
rem 37. We can write Gq in the form 

0 “ < {oa+ 6 (1 — oc)}6“~^ 

or a“— 6“<a6““^(a — 6) (0<a<l), 

which is one case of a system of inequalities prominent in text- 
books of analysis. The complete system is stated in Theorem 41 
below. The theorem is so important that it is worth while to give 
a direct proof from first principles which conforms strictly to the 
criteria of §1.7. 

41. Ifx and y are positive and unequal, then 

(2.16.1) r«r-^{x-y)>3f-if>rf-^{x-y) (r<0orr>l), 

(2.16.2) raf~^{x—y)<3f—y'<ry'~^(x—y) (0<r<l). 

There is obviously equality when r = 0, r = 1, or a: = y. We begin 
by reducing the theorem to one of its cases. 

(i) We may suppose r positive. For let us assume that (2.16.1) 
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has been proved when r>l, and that r<0, r = —a, so that 
«+l>l. Then 

of —y^=s XT* — y-* = x~^y-*-^ — afy) 

= a;-* —x^{y — x)}> «®* (« — x) 

= ry-i(a:- 2 /). ( 

The other inequality in (2.16.1) can be treated similarly.) 

(ii) Let us denote the left- and right-hand inequalities in 
(2.16.1) by (lo) and (16) respectively, and similarly for (2i^l6.2). 
If we interchange x and y, (lb) and (2 6) become (1 a) and (2 a). 
It is therefore sufficient to prove (1 b) and (2 6). 

(iii) We may now suppose, on grounds of homogeneity, that 
y=l. 

The proof of Theorem 41 is now reduced to that of the next 
theorem. 


42. If X is positive and not equal to 1, then 

(2.16.3) af— l>r(a;— 1) (r>l), 

(2.16.4) a?'— 1 <r(a:— 1) (0<r<l). 

If in (2.16.3) we write r=l/s and a;=y^=y*, it becomes 

(2.16.4) with y, s for x, r. It is therefore sufficient to prove 
(2.16.3). 

If q is an integer greater than 1,» and y>l, then 
> 1+ y -t- . . . 4- = -— j- > ?• 

If 0 < y < 1, the inequalities are reversed. Replacing y« by x, we 
obtain in either case 

(2.16.6) ^ ^ <g'(xW«_ i)<a:_i. 

% 

Next, we have 


^ — ?— (qf«« — y«-i — «« 

q+1 q q(q+l)^^^ 

(y-i) 


?(3+l) 


- 1 ) 

+ (y«-^ + y«-*) + . • • + -f- y«-* -h . . . + 1 )}. 


* We abandon here our usual convention oonoeming the meanings of q and p. 
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The curly bracket contains iq{q+ 1) terms, all of which lie be- 
tween and 1, so that 

(2.16.6) 

and BO, if jj is any integer greater than q, 

(2.16.7) 

(y$i). 

Now it follows from (2.16.6) that 

< qHxV<i- 1)2 < (a;- 1)2 

Vu 

if * > 1, while if 0 < a: < 1 the inequalities are reversed. Hence, 
replacing y® by a: in (2.16.7), we obtain 

(2.16.8) 


p — q(x— 1)^ 

2q a:* ^ piq 


-(*- 1 ) 5 ^®*’^® (»-!)» 


(»<1). 


Suppose now that r> 1. If r is rational, we write r for pjq; 
if r is irrational, we make p/q-^r. In either case we have 

(2.16.9) 




which plainly includes (2.16.3). 

This proves Theorems 42 and 41 , but it will be useful to have 
the inequalities corresponding to (2.16.9) when r< 1. We now 
replace by a: in (2.16.7), and use (2.16.6) with q replaced by p. 
We thus obtain 


(2.16.10) 

p-q {x-l)^ ^ 
2p a:* ^ 


X— 1 — 


ifilp—1 ^p — q 

~ qip 


x(x— 1)* 


(» 51 ), 


(2.16.11) 

^ (1 - r) 1 a; - 1 - | ^ ( 1 - r ) a; (a; - 1)« (0<f < l,aJ5l). 
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We have made the proof of (2.15.3) rather more elaborate than is 
necessary, in order to obtain the ‘second order’ inequalities (2.15.6)- 
(2. 1 5. 1 1 ), which are interesting in themselves. If we are concerned only to 
prove (2.15.3), we can argue as follows. Instead of (2.15.6) we write 
simply 


ya+1-.l 


> 


yQ^l 

— » 


wheaoe 


yP_l y._l 

p 


if p and q are integers and Hence we obtain (2.15.3) for r, 

and so, by a passage to the limit \ 


for any r > 1. If now r is irrational, we may write r » a5, where a and a are 
both greater than 1 and a is rational. Then 


af — 1 «(a;*)«— 1 > a(a:*— 1)^ a«(a:— l)=r(a?— 1), 
so that (2.15.3) is true generally. 

For other proofs of Theorem 41 which satisfy the requirements, see 
Stolz and Gmeiner (1, 202-208) andPringsheim(l). Pringsheim uses the 
result to obtain an elementary proof of H. Radon (1,1351) deduces H and 
M from Theorem 41, but proves this by differential calculus. The proofs 
of Theorem 41 given in textbooks are usually limited to rational r; see 
for example Chrystal (1, 42-45), Hardy (1, 138). 


2.16. Elementary proof of Theorem 3. We have proved incidentally 
in the last section a number of inequalities sharper than those stated in 
Theorems 41 and 42. We lay no stress on these, since it is easy to find stiU 
more precise inequalities by the aid of the differential calculus (see § 4.2); 
but it may be interesting to show shortly how they enable us, if we desire, 
to ‘elementarise’ the proof of Theorem 3. 

We observe first that 

(2.16.1) a*'=l + 0(r) 

for fixed positive a and small (positive or negative) r; 

(2. 16.2) ( 1 + w)« = 1 + gt* + 0 (w*) 
for fixed q and small u; and 

(2.16.3) {1 + 0(r*)}i/*^=: 1 + 0(r) 

for small r. We leave the deduction of these formulae from those of the last 
section to the reader. 

Supposing now that r is small, we have o/ = 1 + , where = 0 (r), by 

(2.16.1), and 


= ( 1 + u^)^p = 1 + q^u^ + 0 (r*), 
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by (2.16.2). Hence 


(5 

/ OlV.. 

.a„v y/r 

_( (1+tt, )«!.., 


aw," 

V?ia/+., 

■•+?nO«V 

Ig, (1 +%) + •• 

•+«n(l+Mn)i 




•-+7nMn + 0(r2)| 

.X/, 



1 l+5iM 

1-f ... -hqn^n J 

( 


= {1 + 0 (»■»)}»= 1 + 0(r) 1. 

2.17. Tchebychef’s inequality. We know (Theorem 24) that 
a)l^(a + 6) is comparable (§ 1.6) with 9Jl^(a) + 9Kr(*)- natural 
to ask whether {ah) is comparable with (a) 9R;. (6). Theorem 
43 below shows that this is not so. 

We say that [a) and (6) are similarly ordered if 

for all /X, V, and oppositely ordered if the inequality is always 
reversed. It is evident that (a) and (6) are similarly ordered if 
there is a permutation Vj, of the suffixes such that 

^vi» •••> •••> ^vn non-dccrcasing 

sequences, and oppositely ordered if ... is non-increasing 
and by^, ... non -decreasing; and that the converses of these pro- 
positions are also true. 

43.* // r > 0, and (a) and (6) are similarly ordered^ then 
(2.17.1) 2R,(a)3R,(6) < SR,(a6), 

unless all (he a or all the b are equal. The inequality is reversed when 
the sets are oppositely ordered. 

It is enough, after (2.2.7), to consider the case r= 1. Then 
Tip Tpab — Tpa Tpb = Tp^ TpyUyby — Tp^a^ ^Pyby 

== SS {p^pyf^pby p^pyO^by) ^TT {PvPfi^fj^b^ PvPfi^v^fji) 

= i TT (PfiPyayby'^p^Pya^by-\‘Pyp^a^b^’^PyPnayb^) 

= iTTp^Py{a^-ay) (6^-6„) ^ 0, 

or 

Sl(a)Sl(6)g2l(a6), 

if the series are similarly ordered. 

We can determine the cases of equality as follows. Suppose, as 

• The integral analogue is due to Tchebyohef. See Herxnite (1, 46-47), Franklin (1), 
Jensen (1), and Theorem 236. When r the inequality holds for any real 

and aimiiarly ordered o, 6. 
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we may in virtue of the remarks earlier in the section, that (a) 
and (6) are non-decreasing. The double sum contains a term 

and this can vanish only if all a or all 6 are equal. 

An immediate deduction is that 

2K,(a)9R,(6)...9K,{l)<9R,(o6...1) i 

if r is positive and (a), (6), (1) are all similarly ordeW; in 

particular 3R,(a)<2K,„,(o) \ 

if m is an integer greater than 1. This includes Theorem 6 aind is 
included in Theorem 16. 

The question asked at the beginning of this section is included 
in the more general question settled by the next theorem. 

44. A necessary and sufficient condition that Wlridb ...1) and 
9Jl^(a)2ll^(6) ... 9(yi«(Z), where r, u are positive^ should be com- 
parable, is that 

(2.17.2) + 

r 8 t V 

in which case 

(2.17.3) SK,(a6...Z)^9R,(o)aR,(6)...2R„(l). 

The sufficiency of the condition follows at once from Theorems 
12 and 16. If we take every set in (2.17.3) to be (1, 0 , 0 , ..., 0), we 
see at once that (2.17.2) must be satisfied. A general inequality 
opposite to (2.17.3) is impossible for any r, a, ..., since a^h^...ly 
may vanish for every v and yet the right-hand side be positive. 

2.18. Muirhead’s theorem. In this and the four succeed- 
ing sections we suppose the a to be strictly positive. We denote by 
S!l’(ai,02,...,aJ 

the sum of the n! terms obtained from Fia^, a, a„) by the 

possible permutations of the a. We shall be concerned only with 
the special case 

J’(Oi,aj,...,o„) = ai“i02“«...o„“» (o„>0, a„^0). 

We write 

[a] = [«! , a* , . . ., a J = i S ! . . . o„“« . 
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It is plain that [a] is unaltered by any permutation of the a, 
so that we may regard two sets of a as the same if they difEer 
only in arrangement. We may describe a mean value of the type 
[a] as a symmetrical mean. 

In particular 

[ 1 , 0 , 0 , .,., 0 ]= ••• 

n ! 

the arithmetic and geometric means with unit weights. When 
ai + ag -f . . . + = 1 , [a] is a common generalisation of 81 (a) and 
@(a). 

In general [a'] is not comparable with [a] in the sense of § 1.6. 
The problem solved in this and the next two sections is that of 
determining conditions for comparability. 

We say that (a') is majoriaed by (a), and write 

(a')-<(a), 

when the (a) and (a') can be arranged so as to satisfy the fol- 
lowing three conditions: 

(2.18.1) ai' + oc2'-h ... + a/ = ai + a2+ ... + a^; 

(2.18.2) ... ^a„', ... 

(2.18.3) + a2^ 4" . .. “h oc/ ^ ai + ocgH- ... + a,; {l^v<n). 
The second condition is in itself no restriction, since we may 
rearrange (a') and (a) in any order, but it is essential to the state- 
ment of the third. It is plain that (a)-<(a). 

46. A necessary and sufficient condition that [a'] should be 
comparable with [a], for all positive values of the a, is that one of 
(a') and (a) should be majoriaed by the other. If (a') -<(a) then 

(2.18.4) ^ [a]^[a]. 

There is equality only when (a') and (a) are identical or when all 
the a are equals. 

» Theorem 45 u due substantially to Muirhead (2); but Muirhead considers only 
integral «. 
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2.19. Proof of Muirhead’s theorem. (1) The condition is 
necessary. Suppose, as we may, that (2.18.2) is satisfied, and that 
(2.18.4) holds for all positive a. Taking all the a equal to x, we 
obtain 

This can only be true both for large and for small x if (^.18.1) is 
true. 

Next, take 


Cb-y ~ fltrt — • ■ 


-^v+l 


— ... — — 1, 


X being large. Since (a') and (a) are in descending ord^r, the 
indices of the highest powers of x in [a'] and [a] are 

ai^ + a2^+ ••• “f oc/, a2 + oc2*4‘ •.. + 
respectively. It is plain that the first cannot exceed the second, 
and this proves (2.18.3). 

(2) The condition is sufficient. The proof of this is rather more 
troublesome, and we require a new definition and two lemmas. 

We define a special type of linear transformation of the a, 
which we call a transformation T, as follows. Suppose that 
and oLi are two unequal a, the first being the greater; we may write 

(2.19.1) = + OLi^p — T (0<T^p). 

If now 

(2.19.2) 0^a<Tg/> 
then a transformation T is defined by 


T + cr 


a* =/» + <"= "IT **=+-27 


«/» 


(2.19.3) 


«/ =p-o = 




T + a 


2t ' 2t 
(v+fc, v+Z). 




If (a') arises from (a) by a transformation T, we write 
The definition does not necessarily imply that either the a or 
the a' are in decreasing order. 

It is plain that the sufficiency of our condition for oompara- 
biliiy will be established, and that we shall also have proved what 
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is stated in Theorem 45 about the case of equality, if we have 
proved the two following lemmas. 

Lemma 1. // a'^Ta then [a]^[a], with equality only when 
all a are equal. 

Lemma 2. If (a')-<(a), but (a') ia not identical with (a), then 
(a') can be derived from (a) by the successive application of a finite 
number of transformations T. 

Proof of Lemma 1. We may rearrange (a) and (a') so that 
1, Z=:2. Then 

(2.19.4) n!2[a]-.n!2[a'] 

= ?i!2[p-f T,p-T,a3,...]-?^!2[>-f a,p~cr,a3,...] 

= S ! (a/+r ^ ^^p+T « ^^p+a^^^p-a _ 

= 2 ! {a^a^)P-^a^^^ . . . a^^ a2^+^) ^ 0, 

with equality only when all the a are equal. 

Proof of Lemma 2. We suppose that the condition (2.18.2) is 
satisfied, and call the number of the difierences — which 
are not zero the discrepancy of (a) and (a'); if the discrepancy 
is zero the sets are identical. We prove the lemma by induction, 
assuming it to be true when the discrepancy is less than r and 
proving that it is then true when the discrepancy is r. 

Suppose then that (a')-<(a) and that the discrepancy is r >0. 
Since, by (2. 18. 1 ), 2 (a,, - af) = 0, and not all of these differences 
are zero, there must be positive and negative differences; and, 
by (2.18.3) the first which is not zero must be positive. We can 
therefore find h and I so that 

(2.19.5) cLj/ jfc-f-i ~ > •••> 0C| >aj.® 

We take a^^p + r, as in (2.19.1), and define a by 

(2.19.6) a = Max(|a*'-p|,|a/-p|). 

Then 0<Ti p, since ajfc>a/. Also one or other*^ of 

a/-p=-c7, a^'~p = or, 

• aj - aj' is the first negative difference, - ctj/ the last positive difference which 
precedes it. The text assumes i— ife> 1; the case Z— 1 ia easier. 

^ Possibly both. 
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is true, sinoe g ; and a <r, since a*' < aj^ and a/ > Of . Hence 

0^a<T^p, 

as in (2.19.2). 

We now write 

(2.19.7) aj" = /B + o', «/' = /) — or, a/' = oty (v + ifc, v + i). 

If OLii'—p=a, afc" = afc'; if a.{ —p — — a, aj" = aj'.“ Sincel the pairs 
a/c' and aj, o^' each contribute a unit to the discWancy r 
between (a') and (a), the discrepancy between (a') am (a") is 
smailler, being r— 1 or r — 2. \ 

Next, comparing (2.19.7) with (2.19.3), and observing that 

(2.19.2) is satisfied, we see that (a") arises from (a) by a trans- 
formation T. 

Finally, (a') is majorised by (a"). To prove this we must verify 
that the conditions corresponding to (2.18.1), (2.18.2) and 

(2.18.3) , with a" for a, are satisfied. For the first, we have 

(2.19.8) a*" + a/' = 2p = ajfc4-a„ Sa' = Sa = Sa". , 

For the second, we observe first that 

ip+ 1 P 1 ^p + <^=afc''» 
a,'^p-|a/-p|Sp-a = a," 

and so, by (2.19.5), 

«"s-i = «*-! ^ a* = p + T >p + <7 = a"fc ^ a'fc S a Vi = «fc+i = «"s+i . 

«"f-i = “i-i = “'<-1 ^ a'j S «"< = P - >P - T = «, & = a' Vi; 

and the inequalities affecting the a" are those required. Finally, 
we have to prove that 

di + aj* + . . . -|- ^ cci” -t- • . . -I- dy”. 

Now this is true if v<i: or by (2.19.7) and (2.18.3); it is 
true for v=k, because it is true for v= ft— 1 and a^' ;g a*"; and it 
is true for ft < v < 2 because it is true for v=k and the intervening 
d' and a" are identical. 

We have thus proved that (a') is majorised by (a"), a set arising 
from (a) by a transformation T and having a discrepancy from 
(a') less than r. This proves Lemma 2 and so completes the proof 
of Theorem 45.*> 

* Again, both these equations may be true, 

^ For another proof, see Theorems 74 and 75. 
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2^. An alternative theorem. We shall say that (a') is an 
average of (a) if there are numbers such that 

(2.20.1) p^,^0, S S ^ =1 

fi-i V— 1 

and 

(2.20.2) «/=^Vi« 1 +I>m 2 * 2 + — +?v»«n* 

Since the conditions (2.20.1) are not affected by a permutation 
of the /Li or the v, the definition is, like that of § 2.18, independent 
of the order of the a or the a'. The equations (2.19.3) show that 
(p + a, p — a, ag, ...) is an average of (p + r, p — t, (Xg, ...) when 

(2.19.2) is satisfied. 

The last two conditions (2.20.1) may also be stated as follows: 
Sa', when expressed as a function of the a, is identical with Sa, 
and every a' is 1 if every a is 1. Prom this it follows that the 
relationship is transitive; if (a') is an average of (a), and (a") of 
(a'), then (a") is an average of (a). And from this and Lemma 2 
of § 2. 19 it follows that if (a') -< (a) then (a') is an average of (a). 
The converse is also true. For suppose that (2.20.1) and 

(2.20.2) are satisfied. Then (2.18.1) follows by addition of the 
equations (2.20.2). Finally, if we suppose ((x) and (a') in descend- 
ing order, and write 

we have and 2)!:^ = wi, by (2.20.1); and so 

ai' + aa' + . . . -1- ^ fci ai + . . . + %rn-i + (m ~ ^ ~ 

which is (2.18.3). 

We have therefore proved the two following theorems. 

46. A necessary and sufficient condition that (a') should be an 
average of (a) is that (a') •<(»). 

47. A necessary and suffjicient condition that [a'] should be com- 
parable with [a] is that one of (a') and (a) should be an average of the 
other. If (at!) is an average of (a) then [a'] ^ [a], with equality only 
as in Theorem 45. 

2.21 . Further theorems on symmetrical means . ( 1 ) Theo- 
rems 45 and 47 fulfil two purposes. First, either theorem gives a 
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simple oriterion for deciding whether two means [a] and [a'] are 
or are not comparable. Secondly, the proof of Theorem 45 shows 
us how, by repeated application of the transformation (2.10.3) 
and the formula (2.10.4), to decompose the difference of two 
comparable means into a sum of obviously positive terms. We 
obtain, for example, a new and interesting proof of the tiieorem 
of the arithmetic and geometric means (with imit weights); in fact 

a (a«) - (a«) = [», 0. 0 0] - [1, 1, .... 1] 

= ([n,0,0,...,0]-[ft-l,l,0,...,0]) 

+ ([?i-l,l,0,...,0]-[»-2, 1,1,0 0]) 

+ ([n-2, l,l,0,...,0]-[»-3, 1, 1, 1,0,...,0]) + ... 

~ 2(»I) ^ ~ (®i ~ ®a) + ^ ~ Oa””®) (oi — Oa) ®8 

+ S ! (Oi"-* - 02»-») (tti - Cj) ajO* +...}. 

Since (a, — a,) (a/ —a/)>0 

unless o, = o, , the theorem follows*. 


(2)48. jy ai + ocgH- ... +a„= 1, 

@(a)<[a]<a(a), 

unless [a] is @ (a) or a (a), or all the a are equal. 

This theorem** shows that all the [a] of degree of homogeneity 1 
are comparable with @ (a) and a (a), though not in general com- 
parable among themselves. To prove it we apply Theorem 47; 


since 


1 

n 


a. a, 
n n 



and a^ = a^.l-|-a^+i.0 + ...-|-«„.0-l-ai.0-t-...-|-a^_,.0, 

(1/n, l/», ..., l/n) is an average of (a) and (a) an average of 
(1, 0, ..., 0). Or we may deduce Theorem 48 directly from 
Theorem 45. 


(3) We add two ftirther theorems of a similar oharaoter, with indications 
only of the proofs. 

49. ff 0<a^ I, then a necessary and sufficient condition that [a'] ^ [oc]*^ 
is that (aO-<(o'a). 1/ a > 1, the condition is necessary but not sufficient, 

A This proof was known before Muirhead’s work; see Hurwitz (1). 
b Commnnioated to na by Prof. I. Sohnr. 



ELEMENTARY MEAN VALUES 61 

[To prove the condition necessary, follow the line of § 2. 19 ( 1). To prove 
it suf&cient, combine Theorems 45 and 11. As an example 
[r, 0, 0, . . .] g [«, 0, 0, .. .]*•/• {0<r<8); 

this is Sll,.(a) < iDl«(a) (Theorem 16), with unit weights. The same example 
shows that the condition is no longer sufficient when a> 1.] 

50. If r,p, a are positive and 

r,. = Sp^a/=V 

(in the notation of § 2. 10 (iv)), then a necessary and sufficient condition that 

TaHTat ••• Tajt . . . Ton 

for all a and p is that (a')<(a). 

[The necessity of the condition may be established as before. To prove 
it sufficient we use Theorem 46 and Holder’s inequality, which give 

rp /p 

*fii ®x+ Va “a**"’ * * V"» “*» 
^(ra,)Vi(T^)Va...(T.jV«: 

we have changed the notation slightly in order to avoid conflict with 
that of § 2. 10. The result follows by multiplication.] 


2.22. The elementary symmetric functions of n positive 
numbers. If 


(a:+Oi) («+«,)... (a:+a„) = a!"+Cia:“-^ + Caa;“-*+... + c„ 

= *” + ( j ) + (2) + . . . +3J„ , 

then Cy is the rth elementary symmetric function of the a, i.e. the 
sum of the products, r at a time, of different a, and the average 
of these products. In this section we consider two well-known 
theorems concerning thep^. We write Co=j?o= 

In the notation of § 2.18 


‘=r = fT^7)lS!Oia,...Or. 


1.0.0 0], 


there being r I’e and n—r O’s. Also 25i=8l(a) and p„— ©"(a), 
with unit weights. The different p,, being of different degrees, 
are not comparable*; but they are connected by non-linear 
inequalities. 


• This is a trivial case of Theorem 45. 
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61« Pr^iPr-{‘i<P^ (l^r<n),unl^8 all the a a/re equal. 

52. > jP 2 * > 3^8* > • • • > Pn^^i urdeas all the a are equal. 

Theorem 51 , which was stated by Newton®, is actually true for 
real, not necessarily positive, a; and we shall give a proof of the 
more general theorem, depending on the methods of the di/fferen- 
tial calculus, in § 4.3. Theorem 52 is due to Maclaurin^. 1 
Theorem 52 is a corollary of Theorem 51 , since \ 

(PoPa) (PiPa)*{PaPa)^- (Pr-iPr+iY <Pi^PaW -’P/^ \, 
gives or \ 

This remark, together with the proof of §4.3, disposes of the 
theorems, but it is interesting to consider proofs of them by the 
methods of this chapter. 

(i) Proof of Theorem 52 by the method of §2.6 (iii). We begin 
by proving a theorem similar to but weaker than Theorem 51 . 

53. ® 

This theorem is weaker than Theorem 5 1 , since p,._i < p,.* is 

(r+l)(»-r+l)^ ^ 

To prove it we observe that a typical term in is 

a^a^ , . . , . . a^.^^ 

and that this occurs with the coefficient 




From Theorem 53 it follows that 


( 2 . 22 . 1 ) 

if r<«. 




• Newton (1, 173). See also Maolaurin (2). 

^ liadaurin (2). See also Sohldmilch (1). The inequality Pi>Pn^^ » ^ of 
Theorem 9. 

0 The theorem ia atoted, like Theorema 61 and 62, for poaitiTe a. It remains 
true, aa the proof ahowa, for non-negative a, nnleaa C|.«0 (i.e. unleaa all hat r — 1 
of the a are 0). 
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We can now prove Theorem 52 as follows. If the a are not all 
equal, let Min a, a, = Max a. Then 

(2.22.2) 0i<ai<02, 

where 

We replace and <*2 by ai and ol^, choosing ol^ so that shall 
be unaltered, and prove that any p„ for which v> pis increased 
by the substitution. The result will then follow as in § 2.6 (iii). 
We have 

0 “ «1«2C^-2 + («1 + “ 2 ) + c; , 

where is the formed from the n — 2 numbers other than 
and Uj . Since is to be unchanged 

(2.22.3) {aia2-aia2)c'^_2= + 

(2.22.4) (aiC^_2“^^/u-l) 

The value of ag defined by (2.22.4) is positive because of (2.22.2). 
Also, if becomes p*, 

(”) (l>? - = («! “2 - «x «a) c ;-2 + (ai + aa - Oi - Oj) c;_a; 

and so p*—Py has the sign of 

(*i + *2“®i“®2)(> Jr )• 

\Cv-2 V- 2 / 

The second factor is negative, by (2.22.1) ; and, by (2.22.3), 

spi (*i ®a — ®i ~ ^a) ~ (®i®a *i *a) 

= sgn {ai (aj + aa - Oi - ^ 2 ) + OiOa — *1 * 2 } 

= sgn {(aj - Oi) (ai - Oj)} = - 1, 
by ( 2 . 22 . 2 ). Hence p* >Py, which proves the theorem. 

(ii) Proof of Theorem Slby induction^. Suppose that Theorem 
51 has been proved forn — 1 numbers, Oj, a,, ..., a»_i. and that 

• This proof was oommuzuoated to os indepezidently by MesaiB A. L. Dixon, 
X. E. JoUiiie, and M. H. A. Newman. 
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c', pI are the c,., p^ formed from these n— 1 numbers, whioh we 
shall at first suppose not all equal. Then 

and so Pr = ^Pr+ lf^nPr-l> 

Hence we deduce 

iPr-i Pr+i-Pr^) =A + Ba„ + Ca^*, 
where A = {{n-r)’^-l}p’^_iP'^^T^-(n-r)^p’/, 

S = (« - r + 1 ) (r + 1 ) p;_i !>; + (n - r - 1 ) (r - 1 ) i);_8 j);+i 

-2rin-r)pl_ip'^, 

c^(r^-i)p;_,p;-r‘p;i,. 

Since Oj, Sj, . . . , a„_i are not all equal, we have, by the inductive 
hypothesis, 

p;_lP;+l<p;2, p;_2Pr<K®-l. Pr-iPUl<Pr-lPr> 
so that A<—pl.^, jB<2p'_jp', CK-p'^t^, 

and »®(2>r-ii>f+i-J>?)< 

This proves the theorem. The result is still true when 

••• =®n-l> 

because then o„ + Oi=p'/p'_i. 


It is also possible to prove Theorem 51 by means of identities of the 
type considered in $2.21(1). 

64 - Pr*-Pr-lPr+l = 

'(r+1) 

where (r, <) = 2<j j* . . . 

the eummation extending over aU prodMUe formed from the a and of the type 
shown. 


C) (4.) 


r-1 

S 

i=0 


(2i\ (Tjj) 

\i) 1+1’ 


+ (T:^5^)S(«x-a.)*(a.-a.)«(o»r‘)‘ 


3l(n-5) 


^ (r-l)(r-2)(n-r-l)(n- 
+ ..•* 


Zr ~ 2) ^ "" ^ 


ujftere /he sum of the products, r^^lcUa time, qfihen^2a other thorn 

a^, a^f and so on. 
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Theorem 54 is due to Muirhead (1), and Theorem 55 to JolKffe (1). 
Theorem55 gives an ‘intuitive’ proof of the more general form of Theorem 
51 (for real a of either sign) referred to at the beginning of this section. 

2.23. A note on definite forms. The identity of Hurwitz 
and Muirhead proved in § 2.21 (1) shows that, when a > 0, 

can be expressed as a sum in which every term is obviously 
non-negative. 

If we write , we obtain 

(2.23. 1 ) -h . . . H- x^'^ — nx^x^ 

“ F (n ~ -r)i ~ + •••}• 

Now 

= (*x®"-*+a:j2»-«a:a®+ ... + a;8*"-*) 

is a sum of squares of polynomials such as {x-^ — x^) arj”-*; and so 
the right-hand side of (2.23.1) is a siun of squares. Finally, since 

rf ” -f- . . . -f- a:|« - *2 . . . a;2„ = a;j« -f- . . . *1" - nx^ ...x^ 

-H j + . . . + a:|» - . . . a;|„ + n (Xj . . . a;„ - . *2 J*, 

it follows that 

(2.23.2) F = Xi^™ -f . . . -j- X2„®™ - 27 iXiX 2 . . . a:2„ = SP,*, 

i 

where the are real polynomials of degree n. For example 

3^+y^ + z^+u^+tfi+w^ — 6xyz‘Uvw 

= \{x^+y^-\- z^) {(y® — z®)® + (z® — x®)® + (x® — y®)®} 

-t- |(w® H- »® -f- w®) {(w® — ta®)® + (w® — M®)® -I- (a® — w®)®} + 3 (xyz — uwo)® 

is a sum of 9 -f- 9 -f- 1 = 19 squares of real polynomials. 

A real form is a homogeneous polynomial P (x^ , X 2 , . . . , x„), with 
real coefficients, in the m real variables x^ , Xj , . . . , x„ . A form F is 
said to be definite, in a certain region of the variables, if it does not 
change sign in that region, for example if P ^ 0. We may divide 
definite forms into positive and negative forms, and it is plainly 
sufficient to consider positive forms. Thus the form (2.23.2) is 
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positive in ttie region of all real values of the variables. It is plain 
that a form which has this property must be of even degree. 

If > 0 in a certain region, then F is said to be strictly positive 
in that region. 

The form (2.23.2), and the forms considered in Theorems 7 
and 55 (with x for a), can be expressed as sums of squar^ of real 
polynomials, and it is natural to ask whether this is algeneral 
property of definite forms. Is it true that, if F> Ofor all re^ x, then 

i^ = SP<2, \ 

^ \ 

where the are real polynomials ? ' 

This problem was solved completely by Hilbert^. Here we have 
space only for a few fragmentary remarks. We begin by observing 
that there are two cases in which the answer is immediate. We 
denote the degree of F by 2n, and the number of variables by m. 

If m == 2, so that F — F{x, y) and n is arbitrary, then any real 
factor orc + fty of F must occur in even multiplicity, and the 
complex factors must occur in conjugate pairs ax~\-by, da;-f 
Hence, grouping the factors appropriately, we obtain 
P (g ^ ir) {q ~ ir) = (pqY + {prY, 

where p, q, r are real polynomials. 

It is a familiar theorem of algebra*> that any definite quadratic 
form in any number m of variables may be expressed as a sum of 
at most m squares of real linear forms. Thus the answer is affirma- 
tive in the two cases 

(1) m — 2,n arbitrary, 

(2) m arbitrary, 2w = 2. 

Hilbert found a third case 

(3) m = 3, 2n = 4, 

and proved that any positive biquadratic form in three variables 
is representable as the sum of three squares of real quadratics. 
He also proved that in all other cases the answer is negative, 
there being definite forms of degree 2n in m variables which can- 
not be represented in the manner proposed. 

* Hilbert (1). ^ See, for example, B6oher (1, 144-154). 
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Hilbert then suggested the following theorem: any positive F 
can be expressed as 2 

i * * 

where is a real rational function. An equivalent theorem is: any 
positive F may be expressed as a quotient of sums of squares of real 
forms^. 

Hilbert ^ gave a very diflGLcult proof of these theorems for ternary 
forms in {x, y, z). The general theorems were first proved by 
Artin®. Artin’s proof is very remarkable and comparatively 
simple, but depends upon the ideas of modem abstract algebra 
in a manner which makes it impossible for us to reproduce it here. 


2.24. A theorem concerning strictly positive forms. The 
rather fragmentary remarks of § 2.23 form a natural introduction 
to the simpler problem which we consider here. We are concerned 
now with forms which are strictly positive in the region of posi- 
tive X. The theorem which we shall prove resembles those of 
§2.23 in asserting that a positive form can be represented in a 
manner which renders its positive character intuitive. It is no 
longer necessary that the degree of the form should be even. 

56.^ I f the form F {x^, x^y x^) is strictly positive for 
x^O, Sa; > 0, 
then F may be expressed as 



where 0 and H are forms with positive coefficients. In particular 
we may suppose that 

H == (a?! + + • • • + x^)P 

for a suitable p. 

^ It is evident that the first theorem implies the second (with one square only in 
the denominator). And since 


the second theorem implies the first. 

*» Hilbert (2). cArtin(l). 

^ P61ya (3). The theorem had been proved before (apart from the last clause) by 
Poincar^ (1) when w— 2 and by Meissner (1) when Meissner’s method is 

applicable in principle in the general casOf but does not lead to so simple a result. 
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For simplicity of writing we suppose m=3; no new point of 
principle arises for general m. 

The function F (x, y, g) is positive and continuous in the closed 
region 

(2.24.1) *^0, z^O, x+y+z=^l, 

and has a positive minim u m n in that region. We write 

(2.24.2) F(x,y,z) = 'Z^A^y^^^~, 
the summation being over 

(2.24.3) oc ^ oc+^4-y=w5 ^ 


and 

(2.24.4) 




where ^>0 and 
that 



are the usual binomial coefBioients, so 



and — 

\ a / 1.2.3... a 

for a = 1, 2, 3 

It is plain that <f>{x,y,z;t)-*F {x, y, z) 
when and if we write 

^(x,y,z;0) = F{x,y,z), 
then ^ is continuous in 

®^0, ySO, z^O, x+y + z—1, 
There is therefore an e such that 


(2.24.6) 

^{x,y,z\t)>^[x,y,z\Q)-\n=‘F(x,y,z)-^lt^\li>0 
for 0 < i < 6 and aU ie, y.zin (2.24.1). 

We have also 

(2.24.6) (a!+y+2)*'-”=(*-»)ISjfe-„^|^, 

the summation being over 

K^O, A^O, /a^O, #c + A+/x = i: — ». 
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Multiplying (2.24.2) and (2.24.6), we obtain 
Here we write 

a + #c = a, jS+A = 6, y+/t=c, 
so that a, h, c vary over 

(2.24.7) o^O, 6^0, c>0, a+b + c^k, 
and a, j8, y over 

(2.24.8) OSaSa, OgjSgft, 0gy<c, a+j3+y = n. 

This gives 

(2.24.9) 

G) (;) • 

In (2.24.9), S' implies summation with respect to a, j8, y over 
(2.24.8); but, since = (o) = 0, ... ifa>a, )5>6, ...,wemay 


3 

replace this summation by summation over (2.24.3), i.e. by S„. 
We thus obtain 


(2.24.10) 

(a:+y+z)*-«^’=(&-,^)!S,U|^S„^ 


aPy 


G) 




The ff> here is positive, by (2.24.6), if h is sufficiently large, and 
this proves the theorem. 


( 1 ) The theorem gives a systematic process for deciding whether a given 
form F is strictly positive for positive x. We multiply repeatedly by So?, 
and, if the form is positive, we shall sooner or later obtain a form with 
positive coefficients. 

It is instructive to consider the working of the process for 
= rcj** + a;," + . . . + rTn** - (n - c) , 

where c is positive and small. The coefficient of 

where ♦!+<,+ . ..+»«=n(j'+ 1), in 
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is certainly positive if one of the f is 0. it is 

. (ng)! {nq)\ 

(h~n)!ta!t3!...^ti!(H-n)lt3!... l)!(t,~ 1) !../ 

and has the sign of 

+ij (tj- 1) ... (t,-n+ 1) + ... -(n-€)tif3 ... 

We require ^ to be positive f or all , . . . , , 

If not all of ^3, ... are equal to 1> there is one» say less tlWig+ 1 
and one, say tg, greater than g + 1; and changing ii, into ti+\L» 1 
changes 0 by \ 

-(n-€)t8...i„(t3-ii-l)W0. 

Hence ^ will be positive for all i if it is positive when every t is g + 1. 
It will be positive in this case if 

w(g+i)g(g-i)...(g-n+2)>(n-€)(g+i)" 


or 




and a fortiori^ if g+ 1 > — • 

If this condition is satisfied, all the coefficients in ^ are positive. 

It follows that jP > 0 for « > 0, La? > 0. Making c -► 0, we obtain yet 
another proof of the theorem of the means, in the form La?** ^ nlla?. 

(2) Ifwewrite a?«= l~a?i-...-a?,„_i 

we obtain a theorem concerning general non-homogoneous pol3momial6 
in w — 1 variables. 


57. If a (non-homogeneoua) polynomial / (a?i , x, , . . . , x^^i) is poeiUve in 

theregton »«_iS0, + 

then f{x) can he expressed in ike form 

m = Eca;i“‘ . . . C-i (* - - • • • - 

where the a are non-negcMve integers and the c are positive. 

The theorem is a generalisation of one due to Hausdorff 


MISCELLANEOUS THEOREMS AND EXAMPLES* 

58. If a,p.Y A are greater than -> 1, and are all positive or all 

negative, then + + + 

[For the case a = j8 = . . . = A, see J ames Bernoulli (1,5, 112).] 

» See Theorem 58. ^ Hausdorff (1). Hausdorffhas m=2. 

0 Some of the theorems which follow here are mere exeroises for the reader, but 
most have some independent interest. 
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59. If oO then 

|o + 61*S(l + c)lo|>+ (^l + ^j|6|» 
for all (real or complex) a, 6. 

[See Bohr (1, 78).] 

60. If 3ln • C5n arithmetic and geometric means of , a, , . . . , 

with unit weights, and 3l«+i, ©n+i those of then 

n (^n - ©«) < (n + 1) (3ln+i - ®«+i) 
imless a„+i=:®„. 

[This theorem, commimicated to us by Dr R. Rado, embodies another 
proof of the theorem of the means. If we write a„+i=a 5 *‘+^, (5n=l/***^S 
then the inequality to be proved is 

a;n+i — (n + l)xy*^ 4- > 0 

and this follows from Theorem 41.] 

61. + p (a>0, 5>0, r>l), 
with equality when 6 = a*“~^. 

[Another form of Theorem 37. For this and the next two theorems see 
Young (1, 5, 6).] 

62. + (u>0, v> ~l/p, p>0). 

[Replace r in Theorem 61 by 1 and a, b by u, (1 4*iw)/(l -f p).] 

63. tcv^iAlogtt + e^^ (w>0). 

[Make p 0 in Theorem 62. See also § 4.4(5).] 

64. If o>0, 0 i 02 ...a„ = i’*, then 

( 1 + Oj) ( 1 + Oa) • ‘ . ( 1 + O'n) > ( 1 + 
unless all the a are equal. 

[Chrystal (1, 61). Example of Theorem 40.] 

65. If a and b are positive and p > 1 or p < 0, then 

^ (Sa)^ 

unless (o) and (6) axe proportional. The inequality is reversed if 0 <p < 1. 
[Radon (1, 1351): transformation of Theorem 13.] 

66. If a> 0 then So So“^>n>, unless all the o are equal. 

[Prom Theorems 7 or 9 or 16 or 43.] 

67. S(a + 6)S^<Sar6. 

unless (a) and (6) are proportional. 

[Milne (1).] 
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unless (a), (6), (c) are proportional. 


69. If0<r<saiid 

then SM y (a, 6) (a, b) < 21M, (o, 6) LJlf^ (a, h) < Sa S6, 

unless (a) and (5) are proportionaL 

70. If0<i5:<land 

Ildb^A(Lb^yl^^ . 

for all b, then Sa*® ^ A^. \ 

[This is an analogue of Theorem 15 for the ease 0 < 1; < 1 (when < 0). 
If all a are positive, define b by 

ab=a^, 6=a*~^ b^'=a^, 

when 


(i) La*>il». 

If all a vanish, A must be 0 and there is nothing to prove. If some but 
not all vanish, suppose that a> 0 in a set jB? of ^ members and a=0 in 
the complementary set CJBJ of v=:n— ft members; and define 6 as above 
in JSJ, and by 6 = in OB. Then 

La* = La6 > ^ (La* + = A (La* + 

IS 

Making G-^oo we obtain (i) again.] 


71. If0<h^a^^H,0<kSby^K,then 
,La*L6» 


1^- 




[See P61ya and Szego (1, i, 67, 213), where the conditions for equality 
are given.] 

72. lim 

^ 2Rj;(a) 

If all the a aro positive, there is a similar theorem for — oo. 

the means being formed with unit weights, then 


where ( is the negative and the positive root of the equation 
(l+»)e~*=(l-c)^ 

[See P61ya and Szego (1, i, 68, 216).] 
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74. If [yi » ^ [yi» •••> y*]* [Si% 8/] ^ [8^, 8j] 
for all values of the variables involved, then 

[yi 9 •••» y* » 8j, Sj], 

[It follows from the first hypothesis, and the definition of the sums, that 
[yi > •••>yjl:^Sl]^[yl» •••>yife>8i]; 
and hence, by repetition of the argument, that 
(i) lyi* •••>yfc^8i> 

Similarly, using the second h5rpothesis, 

(i^) ivif •••9yk9^i9 •••» V]^[y/, 8i, ...,8j]. 

The result follows from (i) and (ii).] 

75 . If (a')*<(a) and the a and a' are in decreasing order, then there is 
a greatest non-negative 8 for which 

(i) (i3) = (a/ -J- 8, a,', . . . , a Vi . - 8) < (a). 

If 8 has this value then 

(ii) {0^1fO<{pL9pn) 
and 

(iii) {Pl9 Pk)’^{^l9 •••9fX-k)9 •••> (**+!» •••» tt») 

for some h between 1 and n — 1 inclusive. 

[It is plain from the definitions (a) that (i) is true for 8 = 0, (6) that the 
set of 8 for which it is true is closed, (c) that, if it is true for a positive 8, 
then it is true for any smaller positive 8. Hence there is a ma iciTniiTy) 
non-negative 8 for which (i) and (ii) are true. 

If 8 has this value then either (a) = a„' — 8 = 0, or 

(6) ft + • • • + = OLi -h . . • + Cf>k + S OCjL + • • • + OCfc 

for a Aj<n; for otherwise we could increase 8 without disturbing (i). In 
case (a) 

n— 1 n n n-l n—l 

X Cty^judy =Sft= X X dy 

1111 1 

and so ft + ... = + which is (6) with A; = n— 1. Hence 

(h) is true in any case; and then (iii) follows from the definitions. 

Dr R. Rado, who communicated Theorems 74 and 75 to us, uses them 
to obtain a new and elegant proof of the sufficiency of Muirhead’s criterion 
(Theorem 45 ). The result is true for n = 2 by Lemma 1 of p. 47; let us 
then suppose that n > 2, that the conditions (2. 18. 1), (2. 18.2), and (2. 18.3) 
are satisfied, and that the result is true for any number of variables less 
than n. Then, by the inductive hypothesis, 

0^n^]S[ft>ft]> [ft» •••jft] [ft+l» = ••*>*»»]• 

Hence, using Theorem 74 twice, we obtain 

[a/, ..., OnT = [ai', ft, *nT = [ft » •••» ft] 

~[ft» •••» ft* i^fc+l» ft]^[*l* •••* ®ib* ••• *»] = [«!, 
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76. If a > 0, and r and a are positive integers, then 



unless all the a are equal. 

[Generalisation of Theorem 66. By Theorem 45, 

[r, r, ... (a times). 0, 0 0]>[j. ”, .... jJ. I 

Form the corresponding inequality with r and a interchanged and a 
replaced by 1/a, and multiply.] ' 


77. A necessary and sufficient condition that 

. . . p„an , 

where Pi , . . . are defined as in § 2.22, and the a are positive, for all positive 
a, is that 


for 1 ^m^n, with equality when m= 1. 


[The sufficiency follows from Theorem 51 ; the necessity may be proved 
on the lines of § 2. 1 9 ( 1 ). Dougall ( 1 ) gives a proof for integral a based on an 
identity. For certain special cases, such as 

(0 g #f < A < jit). 


see KritikoB (1).] 


PfA^+^l^+.•,^hfl^^P^ul^P/ll^ • • • Pja ^9 


78. The means [i, i, 0, 0, ..., 0] and [|, J, J, 0, ..., 0] are not comparable. 
[Exeunple of Theorem 45 and illustration of Theorem 48.] 

79. If a>0, and is the arithmetic mean of the /*th roots of the 
products of fi different a, then 

Pi>p,>...>p„, 

unless all the a are equal. 

[Smith (1, 440). Example of Theorem 45: 

[1,0,0 0]>[*,i,0,...,0]>ft,J,i,...,0]>....] 


80. If /Li ^ 0, and x, y, z are positive, then 

xf^(x-y) (x-z)+yf^(y-z) (y-x) + z^(z-x) {z-y)>Q, 
unless a; = 2/ = ^* 

81 . If V ^ 0, 8 > 0, and the a are positive and not all equed, then 
[v + 28, 0, 0, * 4 , . . .] — 2 [i/ + 8 , 8, 0, (X 4 , . . .] -f- [vp 8 , 8, 0 C 4 , . . .] > 0. 

[This result, communicated to us by Prof. I. Schur, is not a consequence 
of Theorem 45, but follows from Theorem 80, with fj. = v/8.] 
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MEAN VALUES WITH AN ARBITRARY 
FUNCTION AND THE THEORY OF 
CONVEX FUNCTIONS 

3.1 . Definitions . The means (o) and @ (o) are of the fonn 

(3.1.1) = 
where ^ (*) is one of the functions 

af, log* 

and (a:) the inverse function. It is natural to consider more 
general means of the type (3.1.1), formed with an arbitrary 
function <f) subject to appropriate conditions. The most obvious 
conditions to be imposed upon ^ are that it should be continuous 
and strictly monotonic, in which case it has an inverse which 
satisfies the same conditions. 

We require the following preliminary theorem. 

82. (») is continuovs and strictty monotonic inH^x^K, 

(ii) H<a,^K (v= 1,2, ...,w), 

(iii) q^>0, Sg,= l, 

then (I ) there is a unique SOI in {H, K)for which 

(3.1.2) ^(S0i) = Sg^(a), 

(2) SOI is greater than some and less than others of the a, unless 
the a are all equal. 

Since f>{x) is continuous and increases or decreases from ^(H) 
to tf>{K) when x increases from H to K, and £3<^(a) lies between 
these limits, there is just one SOI which satisfies (3.1.2). Also 
'Zq{<i>m)-<f>(a)} = (i 

and some terms must be positive and some negative, unless all 
are zero. Hence SOI— u is sometimes positive and sometimes 
negative, unless it is always zero. 

We have assumed 4>{x) continuous in the closed interval 
(ff, K). The argument is still valid if ^x) is continuous and 

3 


HI 
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strictly increasing in H<x<K, while oo when 

or ^ (a:) 4- 00 when x-^K, provided that we then interpret <f> (H) 

as meaning — oo, or <f>(K) as meaning + 00 , and 9Jl as being H 
when (a) = — 00 or jK^ when (a) = + oo. Here H may be 
— 00 or ^ may be + 00 ; a particularly important case is that 
in which jSl®* + 00 . In the definition which follows, and 

in all the discussion of the properties of later in this chapter, 

it is assumed that is strictly monotonic and is eitner con- 
tinuous in the closed interval or behaves in the manner just 
explained. 

We write® 

(3.1.3) = = = 

The weights g are arbitrary positive numbers whose sum is 1, 
and when we compare two means it is to be understood that the 
weights of the means are the same. For ^(a;)=a;, log a: and af, 
2Jl^ reduces to 91, @ and respectively. 

3.2. Equivalent means. The mean is determined when 
the function </> is given. We may ask whether the converse is 
true : if SOl^ = for all a and g, is ^ necessarily the same function 
as X? This question is answered by the theorem which follows. 

S3fi In order that 

(3.2.1) 2R0(a) = a)l^(a) 

for all a avd g, it is necessary and sufficient that 

(3.2.2) + 
where a avd j3 are constants and a4:0. 

In the argument which follows we assume 0 and x continuous 
in the closed interval (H, .ff). It is easy to see that it applies with 
trivial variations in the exceptional cases mentioned in § 3.1. We 
shall actually prove more than we have stated, viz. that (3.2.2) 
is a sufficient condition for (3.2.1) to be true for all a and g, and 
that it is a necessary condition for (3.2. 1 ) to be true for aU sets of 

» In this chapter we define directly, and deduce its propertiM from the 
definition. In Ch. VI (§§ 6.19-fi.22) we shall show how may be defined *axio- 
m&tically *, that is to say by prescription of its characteristic properties. 

^ Knopp (2), Jessen (2). 
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two variables and weights. Later (§ 3.7) we shall prove still more, 
viz. that (3.2.2) is necessary for the truth of (3.2.1) for all sets 
of two variables and a fixed prir of weights. 

(i) If (3.2.2) is satisfied 

X = ^2 X (®) = ^ (®) + P} 

= aSgi^(a) + j8 

and so 9)1^^ = . Hence the condition is sufficient 

(ii) In proving the condition necessary^ we assume only that 
(3.2.1) is true for all sets of two variables and weights. 

In (3.2.1) take 

rr r/' ^ ^ ^ ^ 

n=2, a^^H, az = K, = = 

where H<t<K. Then 


(3.2.3) 


=X~' 


K-t t-H 


for H<t<K\ and this is also true for t:=H and t^K. If we 
denote the common value by x then, as t varies from H to 
X assumes all values in (£f , K) and 






=^x^(x) + p, 


where a and p are independent of x. Hence 

for all X in {H, K); and this is (3.2.2). This completes the proof 
of Theorem 83. 

One corollary of Theorem 83 which is sometimes useful is this. 
Since — ^ is a linear function of <f>, and — ^ increases if <j> decreases, 
we may always suppose, if we please, that the ^ involved in 
{x) is an increasing function. 
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Theorem 83 also enables us to elucidate tho apparently exceptional 
status of SRo= ® among the means SH,. of Ch. II. Since 

1 * — I 

^(*)= ^ 

is a linear function of a?**, for r#0, we have, by Theorem 83, 

aK,(a)=9mji,(a). 

This equation is still valid forr=0, since <f>t(x) = logx. 

3.3. A characteristic property of the means S9L. It is 
natural to ask whether there is any simple property of the means 
of Oh. II which characterises them among the more Wneral 
means considered here. ' 

84 “ Suppose tJiat ^ (a:) is continuous in the open iniervdl (0, oo), 
and that 

(3.3.1) ^^(ka)^km^(a) 

for aM positive a, q, and k. Then W^(a) is SD?r(®)- oiAer words, 
the means 911, are the only homogeneous means 911^. 

Naturally (3.3.1) does not imply ^=a?‘ (or logx); for, by 
Theorem 83, we can replace ^ by <x^ + )3 without changing . 

That (3.3.1) is true when ^=af or ^=loga; is obvious. We 
now assume (3.3.1) and deduce the form of After Theorem 83, 
we may suppose that 

(3.3.2) ^(1) = 0; 
for we may replace f>{x) hy — 

We write (3.3.1) in the form 

aR^(o) = k-m^ m = m) = aK^(«). 

where ^{x) = <^{kx). 

It follows from Theorem 83 that 

(3.3.3) <f>{kx) = a.{k)<f>{x)-{- ^{k), 

where a.(k) and ^{k) are functions of k, and a(^:) + 0; and from 
(3.3.2) and (3.3.3) that 

(3.3.4) <f>{k) = p{k). 

* Nagumo (1), do Finetti (1), Jessen (4). The following simple version of de 
Finetti’s proof was communicated to us Dr Jessen. 

b If we used one of tho more preoire forms of Theorem 83, referred to after its 
enunciation in § 3.2, we should obtain a more precise form of Theorem 84, in whioh 
homogeneity was only assumed for restricted classes of variables or weights. 
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If we substitute from (3.3.4) into (3.3.3), and write y for Jk, we 
find that 

(3.3.6) ^(a^) = «(y)^(a!) + <^(y) 
for all positive z and }/. 

Similarly 

(3.3.6) ^(*y)=a(®)^(y)+<^(*); 
and (3.3.6) and (3.3.6) give 

a(a;)-l ^ tt(y) - 1 » 

^(z) ^{y) 

Each of these functions must reduce to a constant c, so that 
«(y)= 1 +c^i(y). It then follows from (3.3.6) that 

(3.3.7) ^xy)=c<t>{z)^(y)-^i(z)+if>{y). 

In discussing this functional equation, we must distinguish two 
cases. 

(1) If c= 0, (3.3.7) reduces to the classical equation 

<^(a^)=^(») + <^(y). 

The most general solution, continuous for a; > 0, is ^ = C log z. 

(2) Ifc + O, weput c^(x) + l—f(x), 
and the equation reduces to 

whose general solution is /= if. Hence 

8.4. Comparability. Our general remarks on the ‘com- 
parability’ of functions of the a (§1.6) suggest the following 
problem : given two fundione iji and x, eoch continuoaa and strictly 
monotonie in {H, K), are and comparable; i.e. is there an 
inequality 
(3.4.1) 

* Provided xd*!, Since (3.3.7) is plainly true when j; or y is 1, the ez> 
oeption is irrelevant, 
b Cauchy (1, 103-106). 
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(or one in the opposite sense) valid for aU a and qt Theorem 16 
tells us that the answer is affirmative when 0 and x powers. 

We write x (*)} = 0 (*)■ 

Then 0 is continuous and strictly monotonic, and has the inverse 
0~^=0X“^. We also write 

*=0(0), o=0-i(a;). 

Then the * are arbitrary numbers between and 0(^); and 
(3.4.1) takes the form 

(3.4.2) 0(Sga:)^2g0(a:) \ 

(for all g) if X is increasing, the reversed form if x is decreasing. 
We thus obtain 

86. 7/0 and x continuous and strictly monotonic, then a 
necessary and sufficient condition that and should he com- 
parable is that 0 = x0“^ should satisfy (3.4.2), or the reversed 
inequality. 

In what follows, we examine this class of functions 0 in detail. 
For arbitrary weights p, (3.4.2) becomes 

,3.4.3, 

3.5. Convex functions. The function 0 of §3.4 was the 
resultant of two monotonic functions, and therefore itself mono- 
tonic; but now we consider a 0 subject to (3.4.2) only. 

The simplest case of (3.4.2) is 

( 3 . 6 . 1 ) 

A function which satisfies (3.4.2) satiifies (3.6.1), but the class 
of functions satisfying (3.5.1) is more general. We shall however 
show that the two inequalities are equivalent for functions sub- 
ject to certain not very restrictive conditions. 

A function which satisfies (3.6.1) in a certain interval is called 
convex in that interval. If —0 is convex, 0 is concave. We may 
also define convexity or concavity in an open interval. It is 
often convenient to admit infinite values at the ends of the in- 
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terval; it is obvious that such values must be positive for convex 
and negative for concave functions, if the interval is finite. 

The foundations of the theory of convex functions are due to 
Jensen (2).'*' Greometrically, (3.5.1) means that the middle point 
of any chord of the curve y—^(x) lies above or on the curve; 
here a curve means any, not necessarily continuous, graph. The 
inequality 

(3.6.2) ^(?i*i+?2*a) 

(for all g) asserts that the whole chord lies above or on the curve, 
and the general inequality (3.4.2) asserts that the centre of 
gravity of any number of arbitrarily weighted points of the curve 
lies above or on the curve. It is geometrically intuitive that, 
when the curve is continuous, the weakest condition implies the 
stronger, and we shall find that much more than this is confirmed 
by our analysis. We might have taken (3.4.2) or (3.6.2) as our 
definition of convexity, but we have followed Jensen in starting 
from the weakest definition.! The most natural definitions are 
perhaps (3.6.2) and another which we discuss in § 3.19. There is 
some logical interest in assuming as little as possible. 

It is sometimes useful to have a definition of the convexity or con- 
cavity of a finite or enumerably infinite set of numbers. We shall say that 
the set Ox, ... ,a„ is convex if 

^ 1 4" (v = 2, 3, . . . , ti- — 1 ), 

i.e. if the second differences of the set are non-negative. 

Thus we can state Theorem 51, in the less exact form with ‘ g *, by 
saying that the set log j? is concave ; the full theorem is that logp is strictly 
concave (see § 3.8) unless the a are equal. When two products of powers 
of the p are comparable, the inequality which holds between them may 
be deduced (substantially as Theorem 52 was deduced from Theorem 51) 
from the concavity of log j?. This is the kernel of Theorem 77. 

3.6. Continuous convex functions. We now proceed to 
investigate the simplest case in which (3.4.2) and (3.6.1) are 
equivalent. 

If <f>(x) satisfies (3.6.1), we have 

4# (a±^s±^±s) s M ^ (a±*!) 

g ^ («i) + <l> (*a) + ^ (®8) + ^ (*4)» 

* Though H61der (1) had ooiuddered the inequality (3.4.2) before JenM. 
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and so on. We thus prove 

.^(*i) + ^(3?2)+ ••• +^(®n) 


(3.6.1) ^ j ^ ^ 


n 


for a particular sequence of values of n, viz. »= 2"*. 

To prove (3.6.1) true generally, it is enough to prove also 
that, if it is true for n, it is true for » — 1.» Suppose then that 
(3.6.1) has been proved for n numbers, and that 
are given. Taking to be the arithmetic mean iK (wuh equal 
weights) of the n — \ numbers, and applying (3.6.1), we qbtain 
((w— 1)91+91) j.9f\\ 


^(91)=^ 


n 




+X2+... + a:„_i 


n 


. ^ («l)+^2) + . . . + ^ (a^n-l) + 0(^) 


n 


and so 


^ (3t) ^ + — + ^ 


which is (3.6. 1 ) with n—1 for n. Hence (3.6. 1 ) is true generally. 

Next, supposing that, in (3.6.1), the a; form appropriate groups 
of equal numbers, we obtain (3.4.2) for any commensurable q. 

Finally, if i}>{x) is continuous, we can prove (3.4.2) without 
resiriction on the q\ for we may replace the q by commensurable 
approximations and proceed to the limit. We thus obtain 


86. Any continuous convex function satisffiea (3.4.2). 

As an application, we may consider Theorem 17.If«=J(r+<) 
we have, by Theorem 7, 

(Spo*)* ^ Spo’’Spa‘, 

or {2tt/(a)PsgR/(a)9Jl/(a). 

or log SR,* (a) ^ J {log SR/ (a) + log SR/ (a)}. 

In other words 


87. logSR/(a) = rlogSRr(a) is a convex function ofr. 

From this, by appealing to Theorem 86 (or repeating the 
argument by which this theorem was proved), we deduce Theorem 
17 (apart from the specification of the cases of equality). 

* Here we follow the lines of $ 2.6 (il). For s proof following Csuchy’s aigtunent 
mote directly, see Jensen (2). 
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8.7. An alternative definition^. We characterised a convex 
function ^ (x) in § 3.5 by the fact that the middle point of a chord 
of the curve y—<l>{x) lies above or on the curve. Biesz and Jessen 
have made an observation which is interesting and sometimes im- 
portant in applications viz. that, when (f> {x) is continuous, it is 
sufficient to require that some point of the chord should lie above 
or on the curve. 

88 . If <f> {x) is continuous, and there is at least one point of every 
chord of the curve j/ = ^ {x), besides the end points of the chord, which 
lies above or on the curve, then every point of every chord lies above 
or on the curve, so that <f> {x) is convex. 

Suppose that PQ is a chord, and R a point on the chord 
below the curve. Then there is a last point 8 on PR and a first 
point T on RQ in which the curve meets the chord: 8 may be P 
and T may be Q. The chord 8T lies entirely below the curve, 
contradicting the hypothesis. 

This remark gives us an alternative proof of Theorem 86. If 
<f> (a;) is convex, the middle point of any chord lies above or on the 
curve. Hence, as we have proved, every point of the chord lies 
above or on the curve. That is to say 

^ ’t' 5^2^2) = (^l) (72^ (^2) 

if ?i > 0, 22 > ?i + ?2 ~ 1 » ?2 otherwise arbitrary. 

We may then proceed by induction. If + ?2 + ffa ~ 1 > 

(3.7.1) = (3^2+ 

= q^<f> (Xj) + 32^ (**) + (^a)' 

and 80 generally. 

A corollary of Theorem 88 is 

89. If 4> (x) is cordinuoua, and every chord of y-^t {x) meets the 
curve in a point distinct from its end-points, then ^ (x) is linear. 

» M. Kieaz (1), Jesren (2). *’ See, for example, § 8.13. 
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By Theorem 88, every point of every chord lies on or above the 
carve. But Theorem 88 remains tine if ‘above’ is replaced by 
‘below’ in hypothesis and conclusion. Hence every chord of the 
curve coincides with the curve. 

From Theorem 89 we can deduce the refinement on Theorem 83 
referred to in § 3.2. Suppose that I 

{?! ^ {®i) + (®a)} ~ {ffiX (®i) + ?2X (®2)l 

for fixed and arbitrary a. Writing — 

a =0-1 (a:), we obtain \ 

^(7l»l + g2»2) = 7l^(»l)+?2^{*2). \ 

so that one point at any rate of every chord of y=(f>(x) lies on 
the curve. It follows from Theorem 89 that 0 is linear. 

3.8. Equality in the fundamental inequalities. We now 
suppose 0 (x) continuous and convex, and considerwhen equality 
can occur in (3.6.1), (3.6.2), or (3.4.2). 

Suppose that Xi<Xg<x^, that x^^qj^x^+q^x^, and that Pv 
Pi,... are the points on the ciurve y=0(a;) corresponding to 

x^,X 2 If 0 (x) is not linear in {x-^ , x^), there is an x^ in (x^ , x^) 

such that P4 lies below the line Pg . Suppose for example that 
Xt lies in (x^ , x^). Then x^ lies in (x^ , Xi), and Pj lies on or below 
P4P2, and therefore below P1P2. Hence (3.6.2) holds with 
inequality. It follows that eqvality can occur in (3.6.2) only when 
0 (a;) is linear in (xi,Xi). 

This conclusion is easily extended to the general inequality 
(3.4.2). Suppose, for example, that there is equality when »= 3, 
and that iCj < Xg < x, . Then all the signs of inequality in (3.7.1) 
must reduce to equality, and 0(x) must be linear in each of the 
intervals 


1*1 > 


(*2.*s) 


g2*2 + g 8 *a \ 
ga + gs / 
and therefore over (x^.x^). 

We have thus proved 

90. If 0 (x) is continuous and convex, then 

(3.8.1) 0(S?x)<S30(x), 

,Sp0(x) 


(3.8.2) 




Bp 
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unUsa eUher (i) all (he x are equal or (ii) ^ {x) is linear in an interval 
including all the x. 


91. Any chord of a continuous convex curve lies entirely above 
the curvCy except for its end-points, or coincides with iU 
We may say that <f>{x) is strictly cmvex if 

(3.8.3) + 

for every unequal pair x,y. Since a strictly convex function 
cannot be linear in any interval, any such function, if continuous, 
satisfies (3.8.1) and (3.8.2), unless all the x are equal. 


3.9. Restatements and extensions of Theorem S5fi We 
may restate Theorem 85 in the form 


92. If ifi and x continuoiis and strictly monotonic, and x 
increasing, then a necessary and sufficient condition that g 2)1^^ 
for all a and q is that <f) = shovM be convex'^. 

We shall say in these circumstances that x is convex mth respect 
to tft. Thus t* is convex with respect to V when « ^ r > 0. 

The curve y~<f{x) has the parametric representation 

x=4i{t), y=x(f)- 

The chord through the points on the curve corresponding to 
i=s^and^=^2is 

x=i{t), y = i>*{t). 


where 

is the function cofi {t) + j8 




x(y 


which assumes the values x(^i) xih) 
may call y^ift* {x) the ^-chord of 3/ = x (^) * order that x should 

be convex with respect to iff, it is necessary and sufficient that 
X S i*©* that every point of any ^-chord of x should lie on or 
above the curve. 


» Jessen (2, 3). 

** We have actually proved more in regard to the necessity of the condition: see 
our remarks on Theorem 83. 
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Theorem 92 may be generalised as follows. Suppose that 

is a function of m variables vg, and that 

is the result of taking means with respect to , 1 ^ 2 , • • * In succes- 
sion. \ 

93. Suppose that and Xfi continuous and strictly mono- 
UmiCy and that w increasing, Then^ in order that \ 

for all a and q,iti8 necessary and sufficient that every should be 

convex with respect to the corresponding tfi^ . 

It is understood, of course, that the weights involved in the 
operations 911^^ and are the same, though they will generally 
vary with p. That the conditions are sufficient follows at once 
from Theorem 92. To see that they are necessary, we have only 
to suppose a to be a function of a single 

8.10. Twice differentiable convex functions. We postpone 
to § 3.1 8 any further discussion of the general properties of convex 
functions, and consider now a particularly important sub-class 
of such functions, viz. those which possess a second differential 
coefficient. 

94. Suppose that <f) (x) possesses a second derivative (x) in the 
open interval (Jff, K). Then a necessary and sufficient condition that 
^ (x) should be convex in the interval is 

(3.10.1) ^"(a:)^0.» 

(i) The condition is necessary. Replacing ^(a: -I- y) and J(a;-y) 
by t and h in (3.6.1), and supposing that a; > y, so that A > 0, we 
obtain 

ft The important case in practice is that in which (as stated in the theorem) 
exists in the open interval. We usually wish, however, to assert convexity in the 
cfosed interval. Since and are monotonio near the ends of the interval 

and tend to finite or infinite limits; may tend to - qo at the left-hand end and to 
<f 00 at the right-hand end, and ^ may tend to + « at either. The function will be 
convex in the closed interval if its value at each end is not less than its limit at that 
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(3.10.2) <f>{t + h) + <f>{t—h) — 2<f>{t)'^0 

for all A such that the arguments lie in the interval. 

Suppose now that 4>" (t) < 0. Then there are positive numbers 
SandlMohthat _S. 

for 0 < « ^ A. Integrating this inequality from ii=0to«=A, we 
obtain ,^(<+A) + ^(<-A)-2^(()< 

in contradiction with (3.10.2). 


(ii) The condition is sufficient. We prove that ^ satisfies (3.4.2). 
In fact, if X = Ega:, we have 

4, (X,) = ^ (X) + (*, - Z) 4' (X ) + - X)* r (a 
for some i, between X and , and so 

Eg ^ (a:) S (X) = ^ (Hqx). 

If (a;) > 0, there can be equality only if every x is equal to X. 

We have therefore proved 


SS.® If 4>" (x) > 0, then <f> (x) is strictly convex and satisfies (3.8. 1) 
and (3.8.2), unless all the x are equal. 


3.11. Applications of the properties of twice differenti- 
able convex functions. Combining the proofs of Theorems 85 
and 95 (part (ii)) we obtain the following theorem, which will be 
found particularly useful in applications. 


96. Iff, and X Of 6 strictly monotonic, x increasing, 4> = x^ 
and 4>" > 0, then unless all the a are equal. 

Examples. (1) If^ = log a:, x =*, then = Theorem 96 

reduces to Theorem 9. 


(2) If ^ = a;', X = i»*, where 0 < r < a, then ^ =a^^ f > 0. Theorem 
96 gives Theorem 16 (for positive indices). The other cases of 
Theorem 16 may be derived similarly. 

(3) Suppose that f, = a^, where k is not 0 or 1 . Then <f> is convex 
in (0, 00 ) if A < 0 or A > 1, concave if 0 < A < 1. Supposing A > 1, and 


» Hdlder (1). 
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applying Theorem 95, we find 

S3»<(S5a*)»'* 

or Sp* < (Sp)^, 

unless all the a: are equal. If we write px^ab, pa^ = a* we obtain 
Theorem 13, for 1; > 1. The other oases follow similarly. 

(4) Suppose that ^=log(l + e®), so that 

and that the abscissae and weights in (3.8.1) are \og{a^la^), 
logibjb^}, and a, jff, .... We obtain 

+ d>2)^ (ij -f" b^^ , , , (?! + 

unless a^ja^ = * (Theorem 40 ; H for any number of sets 

of two numbers). 

(6) Suppose that ^ = (l4■a^^)^^ where r is not 0 or 1, and 
that the abscissae and weights in (3.8.2) are d^/d^, bjbi, ... and 
aj , , . . . . In this case ^ is convex if r > 1 and concave if r < 1 . We 

find, for example, that 

{(%-f" &!+ ... + liY + (^2^” ^2*1" ••• + 

< + ^20''" + (6/ + + . . . + (i^r ^ 

if f > 1 and djdi , bjb ^ , . . . are not all equal (M for any number of 
sets of two numbers). It will be remembered that both H and 
M can be extended to sets of more numbers inductively, 

(6) 97. Ifd>0,p>0,then 

Sp / Sp Spo /’ 

unless all the a are equal. 

We write this with p instead of q for the sake of s 3 nBimetry. 
The first inequality is (3.8.2) reversed, with ^ix)=\ogx, a 
concave function. It is equivalent to G (Theorem 9). The second 
inequality is (3.8.2), with <f>{x)=xlogx, a convex function. 

8.12. Convex functions of several variables . Suppose that 
D is a convex domain in the plane of {x,y), that is to say, a domain 



I 
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which contains the whole of the segment of a straight linn which 
connects any two of its points*. A function 0(a:,y) is said to be 
convex t» i) if it is defined everywhere in D and 

( 3 . 12 . 1 ) 

for all (Xi , i/i) and (X 2 , ^ 2 ) The definition asserts more than 
convexity in x and y separately; thus xy is a convex function 
of X for every y, and a convex function of y for every a?, but it 
is not a convex function of x and y. 

It is often convenient to use an alternative form of the defini- 
tion just given. Suppose that x, y, w, v are given, and consider 
the values of t (if any) for which (x-\-ut^ y + vt) belongs to D. 
Since D is convex, these values form an interval (which may be 
null). Then we say that O {x, y) is convex in D if 

(3.12.2) y+vO 

is, for every a?, y, u, v, a convex function of t throughout the 
interval of t in question. The definition is equivalent to that which 
we gave before, since, if 

x-huti^Xif x+ut2=^X2f y 'hvt2=y2» 

(3.12.1) becomes 

O is said to be concave if — O is convex. 

If z^O{x,y) is the equation of a surface in rectangular Car- 
tesian coordinates, (3.12.1) asserts that the middle point of any 
chord of thesurfacelies above oronthe corresponding point of the 

» It would be sufficient to consider rectangular domains, but convexity is the 
natural Umitation to impose on D, It is not part of our programme to consider 
questions of analysis situs connected with convex or general domains. 

*> There is a wider generaUsation of the notion of a convex function of a single 
variable which is important in the theory of functions but with which we shall 
not be concerned. The function <P{x, y) is subharmonic if its value at the centre of 
any circle does not exceed its average over the circumference. In particular « is 
subharmonio if it is twice differentiable and 

SS + ^yy ^ 0. 

For the theory of subharmonio functions see F. Riesz (6, 9), Montel (!)• 
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surface. If the surface is continuous we can deduce that the whole 
chord lies above or on the surface, and that the centre of gravity 
of any number of arbitrarily weighted points of the surface lies 
above or on the surface. This is what is asserted in the following 
theorem. 

98. 7/0 (Xy y) ia convex and continuovs, then 

(3.12.3) (Sga;, Y^qy) ^ SgO (a;, y). 

The proof is the same as that of Theorem 86, excdjpt for the 
obvious changes of notation. 

There is also a theorem corresponding to Theorem^SS; it is 
sufficient to assert that no chord of the surface lies (except for its 
end-points) entirely below the surface. All the other remarks of 
§3.7 remain true with the obvious changes. 

A theorem corresponding to Theorems 94 and 95 is 

99. 7/0 (a?, y) ia twice differentiable in an open D, then a necea- 
aary and aufficient condition that it should be convex in D is that the 

gmdraticform + 

should be positive^ for all u, v and all (x, y) of D. 

If Q is stricUy positive^, then (3.12.3) holds with inequality, 
unless all the X and all the y are equal. 

(1) The condition is necessary. If (x, y) is in D, then x(0. 
defined by (3. 12.2), is convex in a neighbourhood of f = 0. Hence, 
by Theorem 94, x” (®) = C = 0. 

(2) The condition is sufficient. If 

1^=1, X='Lqx, T = I,qy 

then 

Y) + {x,-X)<l>J> + (y,- Y)%o 

where the index 0 indicates the point (X, Y) and the index 1 some 
point on the line joining this point to {x„ . y„). It follows that 
SgO (a:,y) (JC, 7) =«I> (2ga:,Sjy). 

If Q is strictly positive, and there is equality, then x^^^X, 
y„— Y for all V. 


0 except for tt=:v=:0. 
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We notice that Q is positive if and only if 

(3.12.4) O^>0, 

and strictly positive if and only if 

(3.12.6) O^>0, 

It is negative if (3.12.4) holds with the signs of the first two 
inequalities reversed. 

The extension of the definitions and theorems of this section to 
functions of more than two variables may be left to the reader. 

3.18. Generalisations of Holder’s inequality. We may 
write Holder’s inequality in the form 

(3.13.1) 
or 

(3.13.2) 2go6 g F-^{LqF{a)} 0-^{ZqG(b)), 

where F {x) = 3f {r> \) and 0 {x) =xf, r' being as usual the index 
conjugate to r in the sense of § 2.8. If we write 
<f> = F-^, i/i=G-\ F(a) = x, G(b) = y, a = <j>(x), b==}l>{y), 
we obtain 

(3.13.3) :Lq<f>{x)ilt(y)^cf>(I^x)^(Eqy). 

The simplest case of this is 

i (aJi) ^ iVi) + i {Xi) 4> (l/a)} + «2)} >l> {i (Vi + ^ 2 )}. 

which expresses the fact that is a concave function of 

X and 3/. When, as here, and ^ are continuous, it is equivalent to 
the more general inequality (3.13.3). Hence, reversing the argu> 
ment (with general <f> and «/f), we obtain 

100. If F and 0 are continuovs and strictly monotonic, then a 
necessary and sufficient condition that %[ab) should be comparable 
mth ^F{a)3Slo(b) is that F'-^(x)0-^(y) should be a concave or 
convex function of the two variables x and y ; in the first case (3.13.1) 
is true, in the second the reverse inequality. 

As an example we may take F (x) — af,G {y) = It then follows 

from Theorems 100 and 99 that 

Sl(a6)^2R,(a)S««(6) 

if r > l,s> 1 and (r — 1) (6* — 1) ^ 1 • If r < 1 , < J , ( I — r) ( I — «) S 1, 
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the inequality is reversed. These are the only cases of com- 
parability*. The argument excludes the cases r=0 and s^O, 
but they may be included by using an exponential instead 
of a power. 

We might look for a more straightforward genera^ation of 
Holder’s inequality. Holder’s inequality asserts that (3.13.1) 
holds if /(a:) and g{x) are inverse positive powers of x, ^d either 

(o) !’(»)= */(*), 0{x)=xg{x), \ 

or \ 

( 6 ) F(x)=rfm, o{x)=\%mi ^ 

Jo Jo 

and we might expect that it would hold for other pairs of inverse 
/and g. The theorem which follows shows that no such extension 
is possible. 

101. Suppose thai f{x) is a continuous and strictly increasing 
function which vanishes for a; = 0 and has a second derivative con- 
tinuous for a;>0, and that g{x) is the inverse function {which has 
necessarily the same properties). Suppose further that F{x) and 
0{x) are defined either by (a) or by (6), and that (3.13.1) is true 
for all positive a, b. Thenf is a power of x and (3.13.1) is Holder's 
inequality. 

We consider case (a).^ If, as in the proof of Theorem 100, we 
write <f> and iff for and then ^ {x) 0 (y) must be a concave 
function of x and y. It follows from Theorem 99 and (3.12.4)® that 
g 0, tft'' S 0 and 

(3.13.4) {f(a:)f(y)}*^^(x)^(y)f'(*)f'(y) 

for all positive x and y. 

If ^ (x) « u, 0 (x) = V, we have 

x=J' (») = «/(«), ^ =/(«), 


and so 



X 

u 


0(x)=:t>, 


(3.13.6) 4>(x)^{x)=x. 

• CSompaie Theorem 44. ^ See Cooper (4) for ease (6). 

• With the appropriate ohangee of aigo. 
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Hence (x) tf/ (x) + 2<^' (x) i/f' {x) + <f> {x) 0" (x) = 0, 

and so, by (3.13.4), 

V + S 

all the arguments being x. This is possible only if 

V + ^ V' = 

or ^'0 is constant. Hence, by (3.13.5), X(l>'l<f) is constant, in which 
case ^ and the other functions are powers of x, 

3.14. Some theorems concerning monotonic functions. 

We collect here some simple theorems which will be useful later. 
The first characterisesmonotonicfunctionsas (3.4.2)characterises 
continuous convex functions. 

102. ®- A necessary and sufficient condition that 

(3.14.1) (y:p)(f,{iix)s'zp<f)(x), 

for all positive x and jp, is that <f>{x) should decrease {in the wide 
sense) for x>0. The opposite inequality is similarly characteristic 
of increasing functions. 

There is strict inequality if <f> (x) decreases strictly and there is 
more than one x, 

(i) If 0 decreases, (f}(Lx)^(f){x), whence (3.14.1) follows. 

(ii) If in (3.14.1) we take n = 2, x^ — x, X 2 = h, pi=s ly p^^^p, we 

obtain {l+p)<f>(x+h)^(f>{x)+p(f>{h). 

Making p-*-0, we see that <f>{x+h)£<f>{x). 

The case <f> (a;) = a:“-^ (0 < a < 1 ), i) = a:, gives Theorem 19. 

103. A sufficieni condition that 

(3.14.2) /(Sa:)gS/{a:), 

for all positive x, is that x-^f{x) shovld decrease. There is strict 
inegyudity if x~^f{x) decreases strictly and there is ware than one x. 

For if we vrn\/e f{x) — x<f>{x), then (3.14.2) becomes (3.14.1) 
with p^x. The condition is not necessary, since (3.14.2) is 
satisfied by any f{x) for which 

f{x) > 0, Maxf(x) ^ 2 Minf(x) ; 
for example f(x) = 3 + cos x. 

» Jensen (2): Jensen does not refer to the necessity of the condition. 
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104. If 

(3.14.3) ,l>(Lpx)^i:p^{x) 

for aU positive x and p, then ^(x)i8a muMple of x. 

If we take n=2, x^^x, x^^y.p^^yf^x and in (3.14.3), 
we obtain 

■ ^)<^) 
y X 

Since we can interchange x and y, is constant. 

3.15. Sums with an arbitrary function : generalisations 
of Jensen’s inequality. We may define 'sums’ involving an 
arbitrary function ^ as well as means. We write 

Here ^(x) is continuous and strictly monotonic, as in § 3.1 ; but 
it is necessary now to assume rather more, since E^(a) is not, 
like Sg^(a), a mean of the values of and so necessarily a 
possible value of ^(x). We therefore assume that ^{x) is positive 
for all positive x and tends to oo either when aj->0 or when 
x-^oo. We shall also assume that the a are all positive, leaving 
the reader to make the modifications appropriate when any a 
is zero*. 

// ^ arid x continuous^ positive, and strictly monotonic, 
then @0 and are comparable whenever (1)0 and x vary in opposite 
directions, or (2) 0 and x vary in the same direction and x/0 is 
monotonic. 

In case (1), 

(8.15.1) 

if 0 decreases and x increases. In case (2), (3.15.1) is true if x/0 
decreases. There is equality in case (1), and, if x/0 is strictly mono- 
tonic, also in case (2), only when there is only one a. 

• Suppose, for example, that where r>0 (the case of $2.10). Then 

^(0)»0, and we need make no distinction between two such systems of the a as 
(1, 1) and (1, 1, 0). If ^(0) were positive it would be necessary to distinguish, and 
the discrimination of the cases of equality in Theorem 105 would become te^ous. 
If ^(ar)- 4 -oo when a7-*-0, then whenever any a is aiero. 

^ The substance of this theorem is due to Cooper (2). 
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In case (1), when x increases, 

@x(®) ^ X~^{x(Maxo)}=Maxa 
and similarly (a) ^ Min a. 

In case (2), suppose that tft and x increase, and write 
0(a)=ar, o=0-i(a:), x0"^=/. 

Then (3.16.1) reduces to (3.14.2), and is true if a:~^/(a:) decreases, 

/{0(»)} _ X(a^) 

decreases. If tp and x decrease, (3.15.1) reduces to (3.14.2) 
reversed, and is true if fjx increases, or if xM decreases. 

The reader will have no difficulty in distinguishing the oases of 
equality. The case ^=a!*, x=a^ gives Theorem 19. 

We may also define weighted sums analogous to those of § 2. 10 (iv), viz. 

where the p are arbitrary positive numbers. reduces to SDl^ if 1, 
to if every p is 1. 

3.16. Generalisations of Minkowski’s inequality. If 

<l> {x) = af , where r > 1, we have 

(3.16.1) sro^^^jg^{aJi^(a) + 2R^(6)}, 

(3.16.2) ^ H®4(«) + ®4 W}* 

(3.16.3) s *{a;^(a)+a:^(6)}, 

all these inequalities being essentially equivalent and included 
in Theorem 24. 

The inequalities are not equivalent for general all of them 
are of the form 

(3.16.4) ^^“*{Sp^(a)} + ^~^{Sp^(6)}, 

but the differences between the weights p are now significant. 
In (3.16.1), Sp = l; in (3.16,2), ;p=l: in (3.16.3), the p are any 
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positive numbers. We call these three oases the cases (I), (11), 
and (III). In discussing them, we shall suppose that 


^ > 0, > 0 

for * > 0. 

The inequality (3.16.4) asserts that ^~^{Ep^(a:)} is, for given 
p, a convex function of the n variables x-^^, x^, ..., a;„; or, after 

§3.12, • that x(0 = <^-MSl>^(*+«<)}, \ 

where the x, p, u are fixed and the x and p positive, is convex in t 
for all t for which aU a: + are positive. If ^ is twice differel^tiable 
this condition is, by Theorem 94, equivalent to x" (0) ^ 0. A 
straightforward calculation shows that 


(3.16.5) 

{fWx" = {f(x)}*Sp«*f'(*)-f'(x){Sp«f (*)}*, 

where 


(3.16.6) X = = 

and have therefore to consider in what circum- 

stances 


(3. 16.7) (x)P ^puH" (X) - r (x) (*)}* ^ 0. 

It is easy to see that (3.16.7) cannot be true generally without 
restriction on the sign of Suppose for example that ^' > 0 and 
that is continuous and sometimes negative. We can then 
choose Xi and x, so that <l>" (x^) < 0, (x^) < 0, and % and so 

Pi«if (x^)+P2«2^'(«2) = 0• 

In this case (3.16.7), for n = 2, reduces to 

{f(x)}*blVf'(*l)+l’2«2V"(*2)}^0, 
which is false. We shall therefore suppose in what follows that 
^>0, tf>">0. 

We can write (3.16.7) in the form 


(3.16.8) 


0{X) = 


f«(x)^ {Sjmf(x)}« 

r (X)= ■ 


• W« take for granted the obvione exteneiona of § 3.12 from two to n Tariables. 
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Now, by Theorem 7, 

(3.16.9) 

Hence (3.16.8) is certainly true for all x, u if 

(3.16.10) 0(x)iSi,i|;M*=Sp^(*). 

for all X. Further, there is equality in (3.16.9) if 
<l>'(x) 

(v=l,2,...,n), 

so that (3.16.10) is both a sufficient and a necessary condition for 
the truth of (3.16.8). Finally, if we write y=<f>(x) and 

(3.16.11) <l)(y) = ^(a!) = ^{^-i(y)}=l|;/^g|!, 

(3.16.10) assumes the form 

(3.16.12) 0(S^)y)SS2)®(y). 

We now consider the three cases (I), (II), (III) separately. 

(i) Incase (I), (3.16. 12)is true if and only if<l>(y)isaooncave 
function of y. 

(ii) In case (II), (3.16.12) is (3.14.2), reversed and with y, <I> for 
x,f. A sufficient (though not a necessary) condition is thatO/y is 
an increasing function of y or, what is the same thing, that 

is a decreasing function of x. 

(iii) In case (III), (3.16.12) is (3.14.3), with the appropriate 

variations. It can be true generally only when O (y) is a multiple 
of y, or when is constant, in which case ^ is of one of the 

forms 

(3.16.13) (o®+6)« (o>0,c>l*), c“+*. 

In these oases it is true. 

There are alternative forms of the conditions (i) and (ii) which 
show better their relations to one another. We shall suppose 
continuous, as we may do without afCecting seriously the interest 

• Since ^">0. 
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of the results. Then from 


®(y)= 


& 

rix) 


we deduce 


®'(y) 


d 4>'{x) 
dx^"{x)^^ 




1 d® (a:) 
<f>' (x) dx^ <f>’' {x) ’ 


Hence 0(y) is concave if and only if ^'(x)f^''(x) is concaile, or, 
what is the same thing, if is increasing. These are 

alternative forms of (i), and an alternative form of (ii) is 9 /^' 
is convex’. 

Summing up our conclusions, we have 


106.“ Suppose that is continuous and that <f>>0, 

<!>" > 0 . Then 

(i) it is necessary and sufficient for (3.16.1) that 4>'l<f>'' should 

be concave, or increasing-, 

(ii) it is sufficient (btit not necessary) for (3.16.2) that should 

be convex, or decreasing-, 

(iii) it is necessary and sufficient for (3.16.3) that <f> should be 
one of the functions (3.16.13). 


We leave it for the reader to formulate the variations of this theorem, 
when (for example) ^ > 0, ^' < 0, > 0, or when the inequalities are re- 

versed. It is instructive to verify that (i) is satisded (from a certain x 
onwards) when )f>=x’’/logx, where p> 1, but not when ^ = x’’logx, while 

for (ii) the situation is reversed. 

* 

3.17. Comparison of sets. Theorem 105 asserts that 


e^(a)^©^(a) 

for a given pair of functions iff and x ^<1 o* The theorems 
of this section are of a different type, involving given sets (a) 
and (o') and a variable function tf>. We consider the conditions 
mider which ®^(.')S©*(a), 


» The first results of this character are due to Bosanquet (1): Bosanquet con> 
eiders case (11). 
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or, what is the same thing for increasing 

(3.17.1) 

for given a and a' and all ^ of a certain class. 

107 . Suppose that the sets (a) arid (a') are arranged in descending 
order of magnitude. Then a necessary and sufficient condition that 

(3.17.1) should be true for all continuous and increasing ^ is that 

ctf ^a^ (v= 1, 2, ...,n). 

The sufficiency of the condition is obvious. To prove it neces- 
sary, suppose that > a^ for some /i, that a^<b< a^\ and that 
<f>* (x) is defined by 

<f>*(x)=:0 (x^b), <^*(x)=:l (x>b). 

Then (a') > /Lt> 1 > (a). 

Hence (3.17.1) is false for <f>*, and therefore also false for an 
appropriately chosen continuous increasing approximation to (f>*. 

Ournexttheoremis connected with thetheorems of §§ 2.18-2.20. 

108 . In order that (3.17.1) should be true for all continuous 
convex <f>, it is necessary and sufficient either that (1) (a')-<(a), 
i.e. that (a') is majonised by (a) in the sense of § 2.18, or that (2) (a') 
is an average of (a) in the sense of § 2.20. 

If these conditions are satisfied, and (x) exists for all x, and is 
positive, then equality can occur in (3.17.1) only when the sets (a) 
and (a') are identical^. 

We have proved already (Theorem 46) that the two conditions 
are equivalent. It is therefore enough to prove that the first is 
necessary and the second sufficient. We may suppose (a) and (a') 
arranged in descending order. 

(i) Condition (1) necessary. Condition (1) asserts that 

(3.17.2) + 0^2^ “f* • • • "f" ■f" 0^2 + • • • 4" flty (y = 1, 2, ..., ?i), 
with equality in the case v — n. 

The functions x and —x are both continuous and convex in any 

» Schur (2) proves that (2) is a sufficient condition, and the remark oonoeming 
the case of equality is also due to him. For the complete theorem, see Hardy, 
Litilewood, and J^olya (2). Karamata (1) considers condition (1). 
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interval. Hence, if (3.17.1) is true, Sa' ^ So and S ( — o') g S(— o), 
i.e. So' — So, which is (3.17.2) for v=», with equality. 

Next, let 

<l>(x) = 0 (ar^o„), ^{x)=x—ay (a:>o,). 


Then ^ (x) is continuous and convex in any interval, and ^ (*) S 0, 
^ (*) ^ a: — o„ . Hence 

Oj "I* Og “f-...“|-0,, " VOy ^ S^ (o ) ^ S^ (o) — Oj^-|* Og "f" .. . 4* 0,1 — V(tipf 
which is (3.17.2). \ 

(ii) Condition (2) ia sufficient. If (o') is an average of (o), w^ have 


••• +J>Mn«n. 


\ 


where P^,v^0, S 3 J^„= 1 , Si)^,=l 

/**=! 

for all fi and v. If ^ is convex, then 

(3.17.3) <f>{a^,')^P^i <l>{ai) + ...+p^„ ^(a„), 

and (3.17.1) follows by summation. 


(iii) If there is equality in (3.17.1), there must be equality in 
each of (3.17.3). 

If <^"{x)>0, and every is positive, then it follows from 
Theorem 95 that all the o are equal, in which case all the o' are 
also equal and the common values are the same. 

In general, however, some of the^)^, will be zero. We shall say 
that o^' and o,, are immediately connected iip^p > 0, i.e. if o, occurs 
effectively in the formula for o^'; and that any two elements 
(whether o or o') are connected if they can be joined by a chain of 
elements in which each consecutive pair is immediately con- 
nected. 

Consider nowthe complete setC of elements connected witho^ . 
We may write this set (changing the numeration of the elements 
if necessary) as 

( C ) ® 1 »® 2 > ®1 >®2 > » 

the o' of C involve the o of C, and no other o, and no other o' 
involves an o of C. Hence, using the sum-properties of the p. 
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SO that 0 contains just as many a' as a. It follows from Theorem 
95, and from the equality in (3.17.3), that all o immediately 
connected with an o' are equal to that o'. Hence all connected 
o and o' are equal, and O contains r elements of each set, all 
equal to eq. 

We now repeat the argument, starting from o,+i, and we con- 
clude that both (o) and (o') consist of a certain number of blocks 
of equal elements, the values of the elements in corresponding 
blocks being the same. 

Incidentally we have proved 

109. If <f>" («) > 0, > 0, = 1, = 1, and o^' = Sp^„o„ , 

V V 

vh&lft 

(3.17.4) 

unless all the a and a' are equal. 

If all the a' are equal, (3.17.4) is a special case of Theorem 95. 

A special case of Theorem 108 which is often useful is 

110. If (f) {x) is continuous and convex^ and | A' | g | A | , then 

(3.17.6) (a; — A') 4- (a; + A') ^ (:r — A) + (x 4- A). 

8.18. Further general properties of convexfunctlons . We 
have assumed since § 3.6 that is continuous. We shall now 
discard this hypothesis and consider the direct consequences of 
(3.6.1). The general lesson of the theorems which foUow will 
be that a convex function is either very reqular or very irregular , 
and in particular that a convex function which is not ‘entirely 
irregular’ is necessarily continuous (so that the hypothesis of 
continuity is a good deal less restrictive than might have been 
expected). 

111 . Suppose that <f> (x) is convex in the open interval {H, K), and 
bounded above in some interval i interior to (J?, JC). Then ^(x) is 
continuous in the open interval (£?, K). Further ^ (f> (x) has everywhere 
left-hand and right-hand derivatives] the right-hand derivative is 
not less than the left-hand derivative] and both derivatives increase 
with X. 
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It follows that a disoontinuoiis convex function is unbounded 
in every interval. 

We prove first that <f>{x) is bounded above in every interval 
interior to {H, K). The kernel of the proof is this. The argument 
of §3.6 shows that 

for any regional q\ it was only in the passage to irrational ^ that 
we used the hypothesis of continuity. Suppose now that % is 
(A, h) and that the upper bound of ^ in f is O. It is enoi^h to 
prove ^ bounded above in (I, h) and (k, m), where I and m ar^ any 
numbers such that H <l<h<k<m<K, If x is in (Z, A), we can 
find a I in i so that x divides (I, ^ ) rationally, and then <f> (x) must 
lie below a bound depending on ^(Z) and and so be bounded 
above in (Z, A). Similarly, it must be bounded above in (A, m). 

To state the argument precisely, let A be the left-hand end of i 
and O the upper bound of ^ in i, and suppose that 

H <l<x<h. 

We can choose integers m and n>m%o that 

m 

lies in and then 

4“0+^#(i)SMax{Cr,#(!)}. 

Hence ^{x) is bounded above in (I, h). 

In proving the remainder of the theorem we may restrict our- 
selves to an interval {H', K') interior to (H, K), or, what is the 
same thing, we may suppose (f> bounded above in the original 
interval. Suppose then that <^{x)^0 in {H, K), that H<x<K, 
that m and »>m are positive integers, and that 8 is a number 
(positive or negative) small enough to leave x+»S inside {H, K). 
Then 

J/ ,{m{x+rS)-¥(n-m)x\ m, e\ . \ 

= i '—\i^<t,(x + n&) + —^4>{x). 
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or i>ix+n^)-<l>{x) ^ <f>{x+mS)-4,(z) 

n m '' ' 

Beplaoing 8 by —8, and combining the two inequalities, we 
obtain 

(3.18.1) 

n m 

nS) 

^ m ^ n 

(the central inequality following directly from the convexity of . 

If in (3.18.1) we take m=l, and remember that we 

find that 

(3.18.2) 

^ ^(a: + 8) - ^ ,^(x) 8) ^ 

We now suppose that 8 0 and n cx), but so that x±n^ remains 

inside the interval. It then follows from (3.18.2) that (a? + 8) and 
(a; — 8) tend to (f> (x), and so that ^ is continuous. 

We next suppose 8>0, and replace 8 in (3.18.1) by S/n. We 
have then 

(3 18 3) 

8^8^ 

^<l>(x)-<f>(x-y)^<l>{x)--(f>(x-8) 

« g? = g , 

where S' = mSjn is any rational multiple of 8 less than 8. Since ^ 
is continuous, (3.18.3) is true for any 8' <8. It follows that the 
quotients on the extreme left and right decrease and increase 
respectively when 8 decreases to zero, and so that each tends to a 
limit. Hence ^ possesses right-handed and left-handed derivatives 
and <f>/, and ^ 

Finally, we may write a; — S' s=a;i, a;=a:2» 

= a;-f S'=a? 3 ), when (3.18.3) gives 

(^s) ^ ^ (^») ^ ^ (^ 2 ) 4* i^i) 
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A fortiori, if «! < ajg < < aj*, we have 


if> {xf) — <l> {x^) ^ <f> (a^g) <f> (a?i) 

— X^^ 3^2 — 3/ j 

Making a:, -►a;*, a;2->a:i, we obtain 

(3.18.4) <!>; {xfj ^ ^ (x^) ^ (Xi), 

which completes the proof of the theorem. 

It is plain from what precedes that 


(3.18.6) 


x,-x. 


\ 


if aij g ajj < ajj g 0 : 4 . 

Theorem 111 asserts nothing about the existence of an ordinary 
difEerential coefficient <f>' (x). It is however easy to prove that <f>' (x) 
exists except perhaps for an enumerable set of values of x. The 
function ^/(*), being monotonic, is continuous except perhaps 
in such a set. If it is continuous at x^ then, by (3.18.4), <f>f{Xi) 
lies between <l>/(Xi) and ^/(a:«), which tends to ^,'{Xj) when 
x^-*Xl. Hence ^i,'(*i) = ^/(ai), and ^'{x) exists for x=Xj. 

It is also plain from (3.18.5) that, if <f>{x) is continuous and 
convex in an open interval (a, 6 ) , then 


\4>(x')-<l>{x)\ 
X’ —X 


is bounded for all x and x' of any closed sub-interval of (o, b) . 


3.19. Further properties of continuous convex functions . 
We now suppose <j) {x) convex and continuous. It follows from 
(8.18.6) that a H<^<K and 

then the line 

(3.19.1) y-4,{^) = \{x-^) 

will lie wholly under (on or below) the curve. In other words 

112 . If <f> (x) is convex and continuous then there is at least one 
line through every point of the curve y=^{x) which lies wholly under 
the curve. 
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A line through a point of a curve which lies wholly on one side of the 
curve (under or over) is called a Stiitzgerade of the curve. If ^ (a?) is con- 
cave, then the graph of ^ (a?) has at every point a Stiltzgerade over the curve. 
If is both convex and concave the two lines must coincide and (^(x) 
must be hnear. 

It is easy to see the truth of Theorem 112 directly. If i<x<x\ and 
P, Qf Q' are the points on the curve corresponding to f , x, x\ then PQ lies 
under PQ\ and the slope of PQ decreases as x approcbches f , and so tends 
to a limit v. Similarly, if x<$ and x tends to the slope of QP increases 
to a limit fi. If were greater than v, and Xi , a?* were respectively less than 
and greater than and sufficiently near to then P would lie above 
PjPg, in contradiction to the convexity of the curve. Hence ft^v, and 
(3. 19. 1) lies imder the curve if A has any value between fi and v inclusive. 

In this proof we do not appeal to Theorem 111, but the proof depends 
on just those geometrical ideas which underlie the more formal and 
analytical argument of § 3.18. 

Suppose now, conversely, that ^ (a?) is continuous and has the 
property asserted in Theorem 1 12. If and Xg are two values of x, 
Pi and Pg the corresponding points on the curve, and P the point 
corresponding to ^ — then both P^ and Pg lie over a 

certain line through P, and the middle point of P^ Pg lies over P. 
Hence ^(x) is convex. 

We have thus proved that the property of Theorem 1 12 affords 
a necessary and sufficient condition for the convexity of a con- 
tinuous function, and might be used as an alternative definition 
of convexity. That is to say, we might define convexity, for con- 
tinuous functions, as follows: a cmtinvmis function <f>(x) is said to 
be convex in (£f, K) if to any | of (//, K) corresponds a number 
swh that <l>{^) + X{x-$)^^(x) 
for all X of (H, K). 

This definition of a convex function is quit© as ‘natural’ as that implied 
in (3.5.2), and it is interesting to deduce some of the characteristic pro- 
perties of continuous convex functions directly from it. For example, the 
inequality (3.4.2) may be proved as follows*. 

Writing as usual ^(6) = Sg6, 

and t n Ving £ s ^ (a), a value which lies in the interval of variation of the a, 

» Jefisen (2). 
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for a certain A= A(f) and all a. Performing the operation on each Bide, 
w® obtain ^{9l(o)}+A{itt(a)-^}g9l{^(o)}, 

or sl{9l(o)}S9lW(a)}, 

which is (3.4.2). It is instructive to compare this curgument with that of 
§3.10(ii). 

i 

3.20. Discontinuous convex functions. Discontinuous con- 
vex functions are, by Theorem 111, unbounded in every interval, 
and their existence has not been proved except under the 
assumption of Zermelo’s Axiom or (what is for our purpos^ equi- 
valent) the assumption that the continuum can be well-orilered. 
If 

(3.20. 1 ) /(a; + y) =/(a;) +/(2/), 

then f(2x)==^2f(x) 

and 2f^^^^f(x+y)=f(x)+f{y). 

Thus a solution of (3.20.1) is certainly convex. 

It was proved by Hamel (1)^ that, if Zermelo’s Axiom is true, 
there exist bases [a, p, y, ...] for the real numbers, that is to say, 
sets of real numbers a, j8, y, . . . such that every real a is expressible 
uniquely in the form of a finite sum 

X = (2-oc -f- bp -f- • • • H" /A 

with rational coeflScients a^b, If we assume this, we can 
at once write down discontinuous solutions of (3.20.1); we give 
f{x) arbitrary values /(a), /()8), ... for a;=a, )S, ..., and define 
f(x) generally by 

f{x)^af{oL)-hbf(P)^... + lf(X). 

Then, ify=a'a-l-..., we have 

f{x + y)=f{{a + a')aL + ...} = (a+a')f{a) + ...^f[x)+f{y). 

For more detailed study of the properties of convex functions, of 
the solutions of the equation (3.20.1) and of inequalities associated with 
it, we may refer to Darboux (1), Fr4chet (1,2), F. Bernstein (1), Bern- 
stein and Doetsch (1), Blumberg (1), Sierpiiiski (1, 2), Cooper (3), and 
Ostrowski (1 ). Blumberg and Sierpiiiski prove that any convex meaaurable 
function is c(yrUimiom, and Ostrowski obtains a still more general result. 


* See also Haim (1, 581). 
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MISCELLANEOUS THEOREMS AND EXAMPLES 

118. If a is aoonstant, a%0, and x = a^> then 
<Sx(a)=S^(a), 

[The corresponding property of SJl is included in Theorem 83.] 

114. An increasing convex function of a convex function is convex. 

115. If every chord of a continuous curve contains a point which lies 
above the curve, then every point of every chord, except the end-points, 
lies above the curve. 

116. If ^(a;) is convex and continuous, a<b<c, and = ^(6) = ^(c), 
then <l>{x) is constant in (a, c). 

117. If all the numbers are positive, then 

»log?+ylog|>(a;+y)log^|, 

unless xla = ylb, 

[(xlogx)">0.] 

118. If f(x) is positive and twice differentiable, then a necessary and 
sufficient condition that log /(a?) should be convex is that 0. 

119. If (x) is continuous for a? > 0, and one of the functions x <j> (x) and 
ff>(llx) is convex, then so is the other. 

120. If <l>{x) is positive, twice differentiable and convex, then so are 

(5^1), 

(the first for positive x), 

121 . Ittjf and x continuous and strictly monotonic, and x increasing, 
then a necessary and sufficient condition that 

— (<h) — 

for all a and q, is that 

should be convex. 

[Compare Theorem 92.] 

122. Suppose that 

or (what is the same thing) 

1 Xi (l>(Xi) 

(il) 1 a?a i>{x2) ^0, 

1 «8 

for all Xi, Xf, Xf of an open interval I for which Xi<X 2 <x^, Then ^(a?) is 
continuous in /, and has finite left-handed and right-handed derivatives 
at every point of I. 



98 MEAN VALUES WITH AEBITBAEY FUNOTIOHS 


If ^(x) is twice differentiable, then (i) and (ii) are equivalent to the 
differential inequality 

[(i) and (ii) are alternative forms of (3.5.2), and ^(a;) is convex, so that 
the theorem is a restatement of parts of Theorems 111 and 94.] 

128. Suppose that j 

(i) ^(a?i) sin (x^ - a?j) + <fi(x^) sin (x^ - a?,) + ^(aJs) sin (a?, - x^) ^ 0,^ 
or (what is the same thing) 

cosrri mnxj^ <l>(^i) 

(ii) cosa^s sinar^ 4>{x^) |^0, 
cosa^s sina?3 ^(a^s) 

for all Xi^x^f a?j of an open interval I for which <a?,<a5j <a:i + 7 r. Then 
^(x) is continuous in I and has finite left-handed and right-handed 
derivatives at every point of 7, 

If ^(a;) is twice differentiable, then (i) and (ii) are equivalent to the 
differential inequality 

f'(a;)-h«^(a;)^0. 

[The result is important in the study of convex curves and of the 
behaviour of analytic functions in angular domains. See P61ya (1, 
320; 4, 673-676).] 

124. A necessary and sufficient condition that a continuous function 
^(x) should be convex in an interval I is that, if a is any real number and i 
any closed interval included in 7, then <j>(x)-\-cKX should attain its maximum 
in i at one of the ends of L If also x and <f>(x) are positive, then a necessary 
and sufficient condition that log^(a;) should be a convex function of logo; 
is that xf-if>(x) should have the same maximal property. 

[For applications of this theorem, which results immediately from the 
definitions, see Saks (1).] 

125. A necessary and sufficient condition that a continuous function 
^(x) should be convex in (a, 6) is that 

1 fx-k-n 

(i) ^(i*?)^2^ j ^ 

for a'^x^h<x<x + h^b, 

[This is a corollary of Theorem 124. If ^(a;) satisfies (i), so does if>{x) + oa?; 
and it is plain that any continuous function which satisfies (i) must possess 
the property of Theorem 124.» 

Theorem 125 may also be proved independently; and there are various 
generalisations. In particular we need only suppose (i) true for every x and 
arbitrarily small ^=^(a;).] 

* For a formal proof, use Theorem 183. 
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126. If 4^{x) IB convex and continuous for eJl x, and not constant, then 
^(a?) tends to indnity, for one or other approach of a; to infinity, in such a 
manner aa to be ultimately greater than a constant multiple of | a; | . 

127. If ^">0 for a:> 0, and ^(0) ^ 0, then ^/a? increases for a;> 0. 

[This follows at once from the equations 

128. If <^">0 for a? ^0 and 

lim {x<j>'—<l>)<0t 

X-Voo 

then <l>lx decreases (strictly) for a; > 0. 

[The limit certainly exists, since a;^' — ^ increases. The result follows 
from the equations used in proving Theorem 127. The cases considered 
in Theorems 127 and 128 are the extreme cases possible when ^">0; if 
neither condition is satisfied, <^/a; has a minimum for some positive a;.] 

129. If > 0 for all a?, (0) = 0, and is interpreted as 0' (0) for a; = 0, 

then <l>lx increases for all x. 

130. If the set Oj , Ua « 2 n+i is convex in the sense of § 3.6, i.e. if 

A*a,, = a,. — + (i/= 1,2, ...,2n — 1), 

then gi + a3 + -..+q 2 n+i > g 2 + gA+...+g 2 w 

n+1 “ n ' 


with inequality except when the numbers are in arithmetical progression. 
[Nanson (1). Add up the inequalities 

r (n— r+ 1) r{n — r) A^Ugr^O. 

Theorem 130 may also be proved as an example of Theorem 108; the set 
formed by the numbers 2, 4, ...» 2n, each taken n+1 times, is majorised 
by that formed by the numbers 1, 3, ..., 2n + 1, each taken n times.] 


131. If 0<a;< 1, then 


1 — a;’*^^ 1 — a;" , 

x/x. 

n+ 1 n ^ 


[Put x=y* and in Theorem 130.] 


132. C is the centre of a circle and AqA^ ... a polygon, whose 
vertices, except (7, lie on the circle in the order indicated. C, Aq, A„, are 
fixed and Aj, Ag, ...» A„^i vary. Then the area and perimeter of the 
polygon are greatest when 

AqAj= A|A2— ••• ~ 


[Let the angle CA^ be Since (sin a;)" < 0 for 0 < a? < tt, we have, 
by Theorem 95, ^ v 


7-3 
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imless all the a.; are equal, and a similar inequality in which is replaced 
by These inequalities give the two parts of the theorem. When 
coincides with they reduce to familiar maximal properties of regular 
polygons.] 

183. Suppose that / and g are continuous increasing inverse functions 
which vanish at the origin, that F=ixft Q^xg^ and that g satisfies the 

“equality g(^)^g{x)g(y). 1 

Suppose further that ^ab^AB for all positive a and h such that 
ZQ(h)^Q(B). Then \ 

[Cooper (3). The result is included in Theorem 15 when /is a pWer 
of X,] 

184. If <l> (x) is convex and continuous for a; ^ 0, i/ = 1 , 2, 3, . . . , and the 
are non-negative and decreasing, then 

^(0) + 2 (mi J - [(n ~ 1 ) a J} ^ ^ (i:a J. 

If also 4>*(x) is a strictly increasing function, there is equality only when 
the are equal up to a certain point and then zero. 

[Hardy, Littlewood, and P61ya (2). Write 

fio = 0, «„ = ai + Ot+...+o,. (v^l), 

and + l)a,.=s„.i + va,, = 2a;, 

l)Ov = 2A, — va^=2/i'. 

It is easily verified that \h'\^h, with equality only if a,. = 0 or 

Oj ~ CI2 ~ ~ • 

It follows from Theorem 1 10 that 

<f> (vay) ■“ ^ {(V — 1) Gy} ^ <l> (Sy) — ^ (^,,-. 1 ), 

and the result follows by summation.] 

185. If g> 1 and decreases, then 

S{»/«-(v~l)«) 

[Example of Theorem 134.] 

186. Suppose that a is a function of vi , 1 / 2 » • • • > ; that i^, . . . , v is 

a permutation of the numbers 1, 2, ..., m; and that the ^ and x a're con- 
tinuous and strictly monotonic and the x increasing. Then, in order that 

sk;- . . . (a) ^ (a) 

for all a and it is necessary that 

(1) Xm is convex with respect to , for /i = 1, 2, . . ., m; 

(2) Xa i® convex with respect to , if A > /x and A and fi correspond to an 
inversion in the permutation ...» (i.e. if the order is . .. /li, . . ., A, . .. 
in the series 1, 2, ...» mbut ...A, ...» /x, ... inij, ij, ..., im)» 

[Jessen (3).] 
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137. In order that 

for all a and q, it is necessary and sufficient that ( 1) ^ and (2) r^< s,, 

(the range of fi and A being defined as in Theorem 136). 

[Jessen (3). The most important cases are: 

(i) w= 1, r<s. 

(ii) w=2, (ii,i2) = (2, 1), = 

The two cases correspond to Theorems 1 6 and 26 . The kernel of the theorem 
is contained in the statement that, whenever the two sides of the in- 
equality are comparable, the inequality may be proved by repeated 
application of the special cases corresponding to (i) and (ii).] 

188. A continuous curve y = <^(a;) defined in an open interval, say 
0 <a 7 < 1, has the following property: through every point of the curve 
there is either (a) a line which lies under the curve, or (6) a line which lies 
over the curve. Then one and the same of (a) and (b) is true at all points 
of the curve, and the curve is convex or concave. 

[It is easy to show that if and Si, are the sets of values of x for which 
(a) or (6) is true, then and Sf, are closed (in the open interval). But a 
continuum cannot be the sum of two non-null closed and exclusive sets.] 

189. Suppose that ^ (a:) is convex and bounded belowin (ff,K),ajid that 
m(x) is the lower bound of ^(x) at x (the limit of the lower bound of ^(x) 
in an interval including x). Then m(x) is a continuous convex function; 
and either (i) <f>(x) is identical with m(x), or (ii) the graph of </>(x) is dense 
in the region H ^ a? ^ if, y ^ m (a:). 

If ^(x) is convex and not bounded below, its graph is dense in the strip 
H^x^K. 

[Bernstein and Doetsch (1).] 



CHAPTER IV 

VARIOUS APPLICATIONS OP THE CALCULUS 


4.1. Introduction. Particulax inequalities arising in ordinary 
analysis are often proved more easily by some special devi^ than 
by an appeal to any general theory. We therefore interrupt our 
systematio treatment of the subject at this point, and devote a 
short chapter to the illustration of the simplest and most luefol 
of these devices. The subject-matter is arranged according to the 
methods and instruments used rather than the character of the 
results. 


DIFFBEBNTIAL CALCTJLTTS: FTTNOTIONS OF ONB ; 
VARIABLE 

4.2. Applications of the mean value theorem. Our first 
ATampIfH depend upon a straightforward use of the mean value 
theorem 

(4.2.1) f{x+h)-f{x) = hf'{x+6h) (0<d<l), 

or its generalisations with higher derivatives. It is a corollary of 
(4.2.1) that a function with a positive differential coefficient 
increases with x. 


(1) We have 

log(a:-H)-loga!=|, 

where a: < | < a: + 1, when a; > 0. It follows that 

^ [a: {log (x+ 1) -log a:}] = log (a: -I- l)-loga!-^ > 0, 

^[(»+ l){log(a;-f l)-loga:}]=log(a!+ l)-loga:-^<0. 


Hence 


^l-f^j increases with x, while decreases. 
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Since the latter function is jl— —j , where 1>1, we 

V4. • \ y/ 

obtain 

140. increases foi z>0; decreases for z>l. 

This is substantially the same as Theorem 35. 

(2) If X > 1, r > 1, we have 


af = l+r(x-l) + |r(r-l)f-a(x-l)*, 
where 1 < | < x, and so 

141. x'>l+r(x-l) + |r(r-l)|^^j* {*>1, r>l). 

This inequality was foimd, in a less precise form, in § 2.16. 

(3) Ifx+ Owe have 
(4.2.2) e® = l+x+ix*e®®, 

where 0 < 0 < 1, and so 


142. c®>l+x (x+0). 

We can deduce another proof of Theorem 9. If 

S?=l, 2ga = 2(, 

and the a are not all equal, we can write a— (l+x)9t, where 
Hqz = 0 and the x are not all zero. Then 1 + x g e®, with inequality 
for at least one x, and 

no«=3tn(l+x)«<9te^®=9l=Sja. 

The argument is a special case of that used at the end of § 3.19. 

(4) The function /(x) = e® — 1 — x — |x*, 
and its first two derivatives, vanish for x=!0. There is no other 
zero off (x), since this would (by repeated application of Rolle’s 
Theorem) involve the existence of a zero off" (x) = e®. Hence 
c?*>l+x+Jx* (x>0), e»<l4-a:+Jx* (x<0). 

The same argument may be applied to any number of terms of 
the Taylor series of various functions. When the function is e®, 
we obtain 
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148. If n is odd then 

(4.2.3) e*>l + a;+|^+... + ~ (*+0). 

Ifnia even then (4.2.3) true for x>0 and the reversed inequality 
for X < 0. I 

4.3. Further applications of elementary differential 
calculus. In this section we give some applications of aUess 
immediate character. \ 

(1) Repeated application of Rolle’s Theorem leads easily to 
the following lemma*: if 

/(*. y) = Co*"* + Cix^-^y +...+c„y^=0 
has aU its roots xjy real, then the same is true of all non-identical^ 
equations obtained from it by partial differentiations with respect 
to X and y. Further, if E is any one of these equations, and has a 
multiple root a, then a is also a root, of multiplicity one higher, of 
the equate from which E is derived by differentiation. 

We use this lemma to prove a theorem proved already in a 
less complete form in § 2.22. 

144.® Ifai,a are n real, positive or negative, numbers, 

Pq— 1, andp^ is the arithmetic mean of the products offx different a, 

(/*= 1 . 2 ,. ..,»-!), 

unless all the a are equal. 

We suppose that no a is zero, since the specification of the 
cases of ecpiality becomes more troublesome when zero a’s are 
admitted. 

Let /(a:, y) = (» + Oiy) (a: + Ojy ) . . • (» + o„y) 

=Po»’* + (2) ••• 

Since no a vanishes, p„4=0 and x/y = 0 is not a root of /=0. 
Hence «/y=0 cannot be a multiple root of any of the derived 

• Maclatirin (2). See P61ya and Szegd (1» 46-47 and 230-232). 

^ That is to say, all equations whose coefficients are not all zero, 
c Newton (1, 173). For further references, see § 2.22. 
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equations ; and therefore no two consecutive p, such as and 
Pn+i> can vanish. Hence the equation 

+ip^xy+ p^+iV^ = 0 , 

which is obtained from / («, y ) = 0 by a series of differentiations, 
is not identical ; and therefore its roots are real, so that 

Finally, the roots of the derived equation can be equal only if 
all the roots of the original equation are equal. 

It will be observed that the a need not necessarily be positive, 
as they were in § 2.22.“ 

(2) Suppose that ^(a:)=log(Spo®), and (what is no real 
limitation) that the a are all positive and imequal. Then 

,, Spa* log a Spa®Spa*(loga)*— (Spa*loga)® 

^=“25^ ’ ^ = (S^ 

(Theorem 7). An easy calculation shows that, if o, is the 
greatest a, then 

<f> (0) = log Sp, lim - logp, . 

a ; ->>00 

It follows from Theorems 127 and 129 that (fijx increases for as > 0 
if Sp g 1, and for all a; if Sp= 1. In the last case 

^=logiDl,(a), lim^=f(0)=log@(a). 

^ a ;->0 ^ 

We thus obtain further proofs of Theorems 9 and 16. 

If, on the other hand, p„ ^ 1 for every v, <f>lx decreases, by 
Theorem 128. In particular ©j,(o) = (Sa*)*^* decreases (Theo- 
rem 19). The general case gives part of Theorem 23. 

(3) The following examples have applications in ballistics. 

146. log sec a; <J sin a: tan a; (0<a!<Jn). 

146. The function Slocrseca; 

where g(x)= {l+aeet)dt==z + log{Beex+taxix), 

decrecisea steadily from \ to 0 as x increases from 0 to 
j^Use Theorem 145 to show that ^{g^p' cot x) < 0 and so p' < O.J 
• Except in Theorem 55, For positive a, see Theorems 61, 64, 77, and § 3.6. 
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147. The function 


<r(x) = 


f (l+sec<)logsec< 

JO 

log seo xf (1 + sec t) dt 
Jo 


increaaea steadily from \to\a8x increases from 0 to 
There is a general theorem which will be found useful in the 
Theorem 147. 

148. I//, gr, andf'lg' are positive increasing functions, then ffg 
increases for all x in question, or decreases for all such x, or decrease 
minimum and then increases. In particular, t/ /(0)=gr(0) = 0i 
increases for x>0. 

To prove this, observe that 



^ V^/ \^' gj g 

and consider the possible intersections of the curves y^fjg, y—flg'» At 
one of these intersections the first curve has a horizontal and the second 
a rising tangent, and therefore there can be at most one intersection. 

If we take g as the independent variable, write f{x) = <l>{g), and suppose, 
as in the last clause of the theorem, that 


/(0) = g(0) = 0, 

or^(0) = 0, then the theorem takes the form: if if>{0) = 0and (g) increases 
for g>0, then <l>lg increases for g>0. This is a slight generalisation of part 
of Theorem 1 27. Theorem 1 48 should also be compared with Theorems 1 28 
and 129. 


4.4. Maxima and minima of functions of one variable. 

A very common method for the proof of inequalities is that of 
finding the absolute maximum or minimum of a function ^ [x) by 
a discussion of the sign of {x), 

(1) Since ^{(1— a:)e®}=s — 

the function (1— a;) has just one maximum, fora:=0. Hence 

149, (a;< 1, oj + O). 

This is also a consequence of Theorem 142. 

(2) Since ^(loga:-a:+l)=.i-l 

the function log a;— «+ 1 has one maximum, for X"* 1. Hence 

150. loga:<a!— 1 (*>0, *+1). 
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More generally log a: < n. (ar^n _ for any positive », since we 
may write a^“ for x in Theorem 150. This result is also a 
ooroUary of Theorem 36. 

(3) Let ^(a;) = l + a^-(l + a:*)i/fc(i+y*')W 

where &> 1, a!>0, y>0. It is easily verified that ^(a;) has a 
unique maximum 0 for a^ = y’^. 

This gives another proof of Hq (Theorem 38), and so of H 
(Theorem 11). 

(4) If X and y are positive, and !;> 1, then the function 



k' 


has the derivative y — a^'*', and attains its maximum 0 for = y*'. 
We deduce Theorem 61 (and so Theorems 37 and 9). 


(6) The function 

<f>(x) = xy—x log z — e^-^, 

where x is positive, attains its maximum 0 for a;— ev~^. We deduce 
Theorem 63. 


4.5. Use of Taylor’s series. — Sa„x’‘and g (x) = S6„a;“ 
are two series with positive coefficients, and a„ g b„ for every n, 
we say that f{x) is majorised by g {z), and write /-< g. It is plain 
that/^ g and/i-< g^ imply , and so on. 

To illustrate the use of this idea in the proof of inequalities, we 


prove 

151. //«„=ai+ 02 +... + a„,ti>Acrc«>l avd the a are -positive, 
(hen a „ 

(l + ai)(l+Oa)...{l+On)<l + f^+|j+-+^- 

In fact 1 + aya;-<e“»®, so that 

n (l+aa:)-<e«n®. 

The result follows by adding up the coefficients of 1, *, a:®, . . ., ®", 
and observing that there is strict inequality between the co- 
efficients of a^. It may also be proved by writing the left-hand 
side in the form 

, n(n—l) 

l+np^+ ?> 2 +-+ 3 >» 


(so that and using Theorem 52. 
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DIFFERENTIAL CALCULUS: FUNCTIONS OF 
SEVERAL VARIABLES 


4.6. Applications of the theory of maxima and minima of 
functions of several variables. The most ‘universar weapon 
for the discovery and proof of inequalities is the general tneory 
of maxima and minima of functions of any number of varialbles. 
Suppose that we wish to prove that two functions ^ and if/ of the 
continuous variables are comparable; let us say, to 

fix ourideas, that — This will be so, if ^ — 0 has a minimu^m, 

if and only if this minimum is non-negative ; and this is a question 
which can always be attacked (at any rate when the functions 
are differentiable) by the standard arguments of the theory of 
maxima and minima. 

The method is attractive theoretically, and always opens a 
first line of attack on the problem; but it is apt to lead to serious 
complications in detail (usually connected with the ‘boundary 
values’ of the variables), and it will be found that, however 
suggestive it may be, it rarely leads to the simplest solution. We 
illustrate these remarks by considering its application to the 
fundamental inequalities G and H. 

(1) To prove G, consider 

/ (a^i , a;2 , . . . , x^^{j = x^^i x^^ . . . x^^ , 

where ~ 


in the closed and bounded domain Xi>0, x^^O. It is con- 

tinuous, and therefore attains a maximum, which is not on the 
boundary (where/ vanishes). At the maximum 


{v=l,2,...,7l~l), 


/ dx^ X^ Xn Jn 
and so the x are all equal to Stt. In this case no serious complication 
is introduced by the boundary values*^. 

(2) We may use H (for two sets of variables) as an illustration 
of the ‘method of Lagrange’. Consider 

f (^1 ) ^*^2 > • • • » ~ • • • ■1' > 

• Compare § 2.6 (i). 
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where b, > 0, subject to the condition that 

*2,..., a;„) = a;i*'+a:a*' + (k>l) 

is a positive constant X. The (»— Ij-dimensional domain defi n ed 
by ® ^ 0, (^ = X is closed and bounded, and some x vanishes at 
every point of its boundary. 


If the maximum is attained at an internal point, then, at that 
point, 


^ kxj‘-^ 


= A 


Xv 


is independent of v; and an elementary calculation shows that 
/= = (Ex^)yk (Sfefc')W. 

There remains the possibility that the maximum should be 
attained at a boundary point, where some x, say , is zero. This 
possibility may be excluded by an inductive argument, since, if 
we assume that the inequality has been proved already for n -- 1 
variables, and that x^ — O, we have 

n-l /n~l \yk /»-l 

/= S b„x, ^ ^ ^ < ( W)"* 

The weakness of the method is that, if we are to argue by induc- 
tion at all, it is better to prove the whole theorem inductively, 
and then we come back to one of the proofs of H given already. 

(3) It is quite usual that the method should, as in case (2), 
prove troublesome when developed in detail; but even in such 
cases it is very useful as indicating a possible solution of the 
problem. 

A great many of our theorems assert inequalities between two 
symmetric functions /(Xi,a; 2 > homo- 

geneous of the same degree and positive for aU positive x. This 
is true, for example, of Theorems 9, 16 and 17 (for unit weights, 
the crucial case). Theorem 45 (in the case of comparability) and 
Theorems 51 and 52. 

When we use Lagrange’s method, we must consider the maxi- 
mum of / for constant g, say for 1. Lagrange’s equations are 

3 / ^ ^9 
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These equations have always a solution with = 

and A is the value of / for this system of values of x. If A is a 
Btiiot absolute maximum of f, then A$r, and there is inequality 
except when edl the x are equal. 


All this is in fact true in the cases mentioned, but th^re are 
other oases in which the solution does not give the maJ^um 
of /. This happens for example in Theorem 45, in the olase of 
non-comparability. \ 


INTBGEAI. OALOTTLtrS 


4.7. Comparison of series and integrals. There are many 
inequalities which are proved most easily by arguments based on 
the integral calculus; and often, by consideration of areas or 
volumes, in an ‘intuitive’ way. We give here a few of the most 
useful general theorems, in which the integrals considered are of 
the elementary Biemann or Riemann-Stieltjes type. In Chi VI 
we shall consider inequalities between integrals systematically, 
and there we shall use the general Lebesgue and Lebesgue- 
Stieltjes integrals. 

The theorems which foUow immediately are due in principle 
to Maclaurin and Cauchy*. 

162. Iff (x) decreasea for x ^ 0, then 

f(l)+f(2)+...+f(n)i jj{x)dx^f{0)+f(l)+...+f(n-l). 
There is inequality if f{x) decreases strictly. 

In fact /(v+l)gj f(x)dx$f{v) 

(with inequality if/(x) decreases strictly). 

Further theorems of the same t 3 ^, which we state without 
proof, are: 

168. Ifuo < 0 i< 02 < ..., and f(x) decreases for x'^a^, (hen 

S f ”*/(*) da:g S 

J OO *'*•1 

» Maolaurin (1, i, 289); Cauchy (2, 222). 
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164. If f{x)'^0, andf(x) decreases in (0, f) and> increases in 
(f, 1), where 0 g f S 1, th^ 


1 

n 


•^(s)+-^(i)+-+-^(V)} S J/W*' 


155, If f[x^y) is a decreastTig function of x for fixed y and a 
decreasing function of y for fixed Xy then 

m n rm rn n-ln-l 

s f{x,y)dxdy^ S S 

Jo Jo p.«0 l^—O 

Applications of these theorems, particularly to the theory of 
the convergence of series, may be foimd in any textbook of 
analysis. 


4.8. An inequality of Young. The simple but useful theorem 
which follows is due to W. H. Youngs and is of a different type, 

156. Suppose that </>(0) = 0, and that (f>(x) is continuous and 
strictly increasing for a; ^ 0; that iff {x) is the inverse function, so that 
iff (x) satisfies the same conditions ; and that a ^ 0, 6 ^ 0. Then 

ra rh 

ab^ 1 <f>{x)dx+ I tfi{x)dx. 

Jo Jo 

There is equality only ifb = ^{a). 

The theorem becomes intuitive if we draw the curve 
OTX=ij»{y), and the lines x=0,x=a,y=0,y=b, and consider the 
various areas boimded by them. A formal proof is included in that 
of the more general theorems which follow. 

A corollary of Theorem 156 is 

167. If the conditions of Theorem 156 are satisfied, then 
ab^a<f>{a)+bil>(b). 

Theorem 157 is weaker than Theorem 156, but often aseffeotive 
in applications. 

We pass to more general theorems which include Theorem 156. 

• W.H.Yotuig(3). 
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158. Suppose that 1, 2, n; that a^^O; (hatfy{x) ie con- 
tinwma, non-negative, and atricUy increasing', and tiiat one of the 
/„(0) is zero. Then 


n/,(a,)gS 


J 0 


fM).dfy{xy, 


andthereiseqtudityotdyifai = a^=...=a„.^ 

The inequality may be made intuitive by considering the curve 
x„ =f„ {t) in n-dimensional space, and the volumes bounded by the 
coordinate planes and the cylinders which project the curv)^ on 
these planes. 

To obtain a formal proof, put 


\ 


(O^argoJ, F,(x)=fy{a„) {x^a„). 


so that F,(x) ^f„(x). If we suppose, as we may, that is the 
largest of the a, , then, since II (0) = 0, we have 

ron rttn 

n/,(a,)=nj’,(aj= d{ni’,(*))=s n F^{x).dF,(x) 

Jo I' J 0 

rav rat, 

= 2 IlF^{x).dfAx)^i:\ Uf^{x).dfAx)i 

V J 0 vjo 

and there is inequality unless every a„ is equal to a „ . 


159.>> Suppose that g^ (x) is a system of n continuotis and strictly 
increasing functions each of which vanishes /or a: = 0 ; that 


(4.8.1) (*) = *; 

and that a, ^ 0. Then 

Jo X 

There is inequality unless g^ (Oj) — (Oj) = . . . = gr„ («„). 

If we put 

fl'rM®) = Xv(«). b„^gAaf), av=9'."MM = Xv(M 
and apply Theorem 158 to the system /^(a:) = Xv{x), we obtain 


• Oppenheim (1). The proof is by T. G. Cowling, 
b C^per (1). 
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A system of n functions 

connected by (4.8.1) is a generalisation of a pair of inverse 
functions. For suppose that n = 2, gi(x)=tx<f}{x), g^(x)-x\lf{x), 
and write (4.8.1) in the two forms 


X 


Then 










^ (•■») = {9i ix)}> ^ (a:) = = gr^ {Ut (a;)]. 


and g^^gi, gC^O^ inverse; and the functions ^ and 0 of 
Theorem 156 can always be represented in this form. Hence 
Theorem 159 includes Theorem 156. 

If in Theorem 159 we take (7„(a;) = a?^«F, where then 

(4.8.1) is satisfied, and we obtain 

which is Theorem 9. If in Theorem 156 we take ^ (x) = where 

k> 1, we have ^ (a?) = re* and we obtain Theorem 61. If we take 

^{a;) = log(a:+ 1), j/r(y) = e*'- 1, 

and write v for a-i- 1 and 6+1, we obtain Theorem 63, for 

We pointed out above that Theorem 156 is intuitive from 
simple graphical considerations. If instead of reckoning areas we 
count up the number of lattice-points (points with integral 
coordinates) inside them, we obtain 

160. If (l>{x) increases strictly with x, <^(0)==0. and ^{x) is the 
function inverse to (f) (x), then 

m 

mn^Ii[<l> (ft)] -fS (i/)], 

0 0 

where [y] is the integral part of y. 

This theorem is less interesting in itself, but illustrates a type 
of argument often effective in the Theory of Numbers. 

• Actually the lesult is true for tt>0 and all v, bee § 4.4 (5). 



CHAPTER V 

INFINITE SERIES 


6.1. Introduction. Our theorems so far have related to finite 
sums, and we have now to consider their extensions to infinite 
series. The general conclusion will be that our theorems r^ain 
valid for infinite series in so far as they retain si^gnificance. \ 

Two preliminary remarks are necessary. \ 

(1) The first concerns the interpretation of our formulae. An 
inequality X<Y {otX^ Y), where X and Y are infinite series, is 
always to be interpreted as meaning ‘if Y is convergent, then X 
is convergent, and X<Y (or Y)’. More generally, an in- 
equality of the type 

(6.1.1) Z<SAP...Z« i 

(or Z gSAP ... P), where Y , ..., Z are any finite number of 
infinite series, E is a finite sum, and A, b, ... , c are positive, is 
to be interpreted as meaning ‘if F, ... , E are convergent then 
Z is convergent, and Z satisfies the inequality’. Neglect of this 
understanding would lead to confusion when it is ‘ < ’ which 
stands in the inequality. We could read ‘F’ as ‘oo’ in the case 
of divergence; then ‘Z^oo’ would convey no information, but 
*Z < 00 ’ would imply the convergence of Z, and this implication 
would usually be false. 

Some inequalities will occur which are not of the form (6.1.1). 
These are usually secondary, and should be reduced to the form 
(6.1.1) if there is any doubt about their interpretation. Thus 

Z»<AF6 

should be interpreted as 

Z<A"'»F«« 

which is of the form (6.1.1); and Z> F should be interpreted 
as F<Z. 

There is one important inequality, viz. 

(6.1.2) Sa6 > (EaA)W* (Sd*")!/!^, 
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where ij<l, i + O,® which we have written deliberately in a 
form unlike (6.1.1). We might have written it as 

(6.1.3) Sa* < (Sa6)* 
when 0<h< 1, or as 

(6.1.4) < (Sa6)*' (Sa*)-^/* 

when i<0. These are of the form (6.1.1), and are the forms 
which arise primarily in the proof of Theorem 13. We prefer the 
form (6.1.2) for formal reasons, and because it shows clearly 
the contrast between the two cases of the theorem, but we 
must use the other forms if we wish to show explicitly and 
exactly the implications about convergence. 

There are a few cases where the inequality asserted is not one 
between infinite series but involves the results of other limit 
operations. Thus, when we extend the inequality @(a)<Maxa 
(Theorem 2) we obtain an inequality between an infinite product 
and the upper bound of an infinite set. Such an inequality 
'X < 7’ is to be interpreted in the same way, viz. as ‘if 7 is 
finite then X is finite and X < Y\ 

(2) The second remark concerns method^ and should be read in 
conjunction with §1.7. Suppose, for example, that we wish to 
prove the inequality (2a6)2 g 

for infinite series. We know the inequality for fimte sums 
(Theorem 7) and, everything being positive, our conclusion 
follows by a passage to the limit. 

We cannot extend Theorem 7, in its complete form, to infimte 
series in so simple a manner, since in the limiting process ‘ < ^ 
degenerates into ‘ ^ and we are unable to pick out the possible 
cases of equality. Here and elsewhere, we must avoid limiting 
processes; instead of deducing the infimte theorem from the 
finite one, we must verify that the proof given for the finite 
theorem remains valid, with that mimmum of change required 
by the new context, in the infimte case. For example, either 
proof of Theorem 7 given in § 2.4 may be extended to the infimte 
» (2.8.4) of Theorem 13. 


8-3 
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case by the addition of a few obvious comments concerning 
convergence. 

It will not be necessary to retrace the path followed in Ch. II 
systematically. The few new points which arise are neither 
difficult nor particularly interesting, and, in so far as thev are 
important, recur in a more interesting form in Ch. VI. Wejshall 
therefore arrange the substance of this chapter informally, illus- 
trating and commenting upon the new possibilities, and enmng 
with an enumeration of some of the more important theorems 
of Ch. II which remain valid with the new interpretation. \ 

6.2. The means 9)1,.. We begin by some comments on a new 
point which arises in the definition of the means 9Rr- We have 
now an infinity of terms a and weights p, and there are two cases 
to consider, according as Hjp is convergent or divergent. 

(i) If is convergent we may suppose that the sum is 1 and 
write q for p. In this case 2R,. is defined, for r > 0, by 

(6.2.1) 9)^,(a)=={S^a0'^^ 

andmay be regarded as a ‘mean’ in the sense of § 2.2 or a ‘ weighted 
sum’ in the sense of § 2.10 (iv). It is finite or infinite according as 
Sga** is convergent or divergent. 

(ii) If Sp is divergent, we can still define 9)1,. as a limit, e.g. by 

( n n \llr 

1 1 / 

or as the corresponding upper limit lim. The latter definition is 
not particularly interesting, though it would preserve most of 
our theorems. If we define 9)4 by (6.2.2), we are met by a 
difficulty : the existence of 9)1,. /or a given r does not ensure its exist- 
ence for any other r. In fact we can determine thea so that9K,. shall 
exist for a given set , rj , . . . , r„j of values of r and for no others. 
We shall therefore confine our attention to case (i). 

For the general question of the existence of 9)lrt sec, for example, 
Besicovitch (1). We may illustrate the point by showing briefly how to 
find a so that either of the limits 

fini- {oi+Of+.-.-f oj, lim;i(ai®-f a,*+...+a„2) 
n n 

may exist without the other: herep= 1. 
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Ti^e first two sequences 

®1 » * • * • ♦ t 0^1 » Otj • • • • • Pi 9 Pf f ••• t Pv > Pi * P% f • • • 

with period w. When a = a, both limits exist and have the values 


A ««« 4-a>gr 

^ 95“ * 


^ + ... + atr*. 

;;; » 


and when a = j5 they have the corresponding values Bit 
Now take the a as follows: 


cLit cL^i aw (repeated times), 

Pi* Pt* Pv (repeated Nji times), 

«i 9 « 2 » <3Cw (repeated N 3 times), 


It is easy to see that, by supposing the sequence Ni , NJNi , NJN ^ , ... to 
tend to infinity with great rapidity, we can make (%+ ...+a„)/n and 
( 01 * 4 - 4*On*)M oscillate between Ait Bi and A^t Bj respectively. The 
conditions for convergence will then be Ai = and B 3 respectively, 
and we can obviously choose the a and p so that either of these conditions 
shall be satisfied without the other. 

We therefore restrict ourselves to case (i). We define SHy, for 
positive or negative r, by (6.2.1), with the convention that 
90?,. =0 if r is negative and an a zero or Sga** divergent. We 
define @ (and 99lo) by 

(6.2.3) ® (a) = («) = na« = exp (Sgloga), 

with the conventions that @ = 00 if Ha® diverges to 00 (i.e. if 
llgloga diverges to + 00 ), and @} = 0 if Ila® diverges to 0 (i.e. if 
Sgloga diverges to — 00 ). It is to be observed that loga may 
have either sign, and that the definition of & fails if Sgloga is 
oscillatory. In this case @ is meaningless. 

It follows from Theorems 36 and 150 that 

(6.2.4) (0<r<«, <>0), 

unless < = 1, when there is equality. We define log + 1 and log" t by 
log+«=log« (<>1), log+<=0 (0<t^l), 

log"f=logi (0<f^ 1), log"<=0 (i>l), 

so that 

log+ 1 S 0. log- < g 0, log < = log+ 1 + log- 1, log - 1 = - log+ ~ . 
It then follows from (6.2.4) that 

O^S?log+ogls'? (o'-- 1) ^ 1), 
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where S' denotes a stmunation extended over the a which ez- 
oeedl. Hence, if 2R,(a) is finite for some positive s, then 
finite for 0 <r<«, and Sglog+ais convergent. We can prove 
sinularly that, if 99 l_,(a) is positive for some positive «, then 
9 Jl_,(o) is positive for — «< — r< 0 , and S 9 log~a is convergent. 
In the first case ® is positive and finite or zero, in the second 
it is positive and finite or infinite. If Sgloga oscillates xhen 
S 9 log+ a and 'Zq log~ a- are both divergent, and this is only 
possible when SK,(o) = oo for all positive r and 9R,(o)=aO fo^ all 
negative r. It is only in this case that the definition of ^(a) 
can fail. 

There is one new point which, as we shall see in § 6.9, affects 
the specification of the cases of equality in some of our theorems. 
This point arises from the possibihty that, when rgO, 
may be zero although no a is zero. If r > 0 then, as m Ch. II, 
SR, (a) can be zero only if (a) is null, in which case 9R, (a) = 0 for 
all r. But when r g 0 there is a difference. The 3R, (a) of Ch. II 
was zero, for such an r, if and only if some a was zero, and then 
SR, (a) was zero for all r g 0 . It is now possible, when r S 0 , that 
SR«(a) should be zero for 8<r and positive for e^r, or zero for 
a^r and positive for a>r. 

Thus in Theorem 1 there were two exceptional oases; 

Min a < SR,(o) < Max o, 

unless either all the a are equal, or else r < 0 and an a is zero. 
All that we can say now is 'unless either all the a are equal (in 
which case both inequalities reduce to equalities) or else r < 0 and 
SR, (a) = 0 (in which case Min o = 0 and the first inequality reduces 
to an equality) ’. Substantially the same point arises in connection 
with Theorems 2, 5, 10, 16, 24 and 25 (to quote only oases 
referred to in our summary in § 5.9). 

5.8. The generalisation of Theorems 3 and 9. We use 
the inequalities (6.2.4), and the equation 

hm “logt. 

r -*-0 
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Taking t=>afSqa^al^, r «s 1 in (5.2.4), we have 

logo-log 2lg^-l, 

log ® - log 21 =Sg (logo -log 31) g 1- 1 = 0, 

with equality only if every o is 2(. This proves the analogue of 
Theorem 9. 

Suppose now that is finite for some positive 8. Then is 
positive and fimte or zero: the proof below applies to either case*^. 
Given € > 0, we can choose N so that 

(5.3.1) 2 gloga<log(® 4 -€), 

n^N 

(6.3.2) S 

n>N ^ 

and then so that 0 < »•„ < « and 

(6.3.3) S q- — S gloga+£ 

nSN ^ n&N 

for 0 < r < fj. We have then 

log ® (o) = ^ log ® (o’") ^ ^ log 21 (or) g 

„ a’’ — 1 — a’^—1 XI, _a*— 1 

= S q h S q < S g'loga+€+ S q 

nS^N ^ n>N f »SiV n>N « 

< log (@ + e) + 2e. 

Hence log 911, (o) =i log 21 (aO-*- log® (a) 

when r-»- +0. We leave it to the reader to prove that, if 9R, is 
positive for some negative r, then 9)1, -»-® when r-»— 0. 

6.4. Holder’s inequality and its extensions. The proofs of 
Holder’s inequality, and other theorems of the same type, given 
in Oh. II apply equally to infinite series. We may observe in 
passing that the series may be multiple series. Thus 
(SSo^,6^,)»^SSo2,SS52,. 

* It is modelled on the proof by F. Riesz (7) of the oorresponding integral 
theorm: see § 6.8. 
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For example, suppose that and are convergent, and 
take 

Since SS {(i+v)~^~^ is convergent, it follows that 


SS— 

(n+v)^-^ 

is convergeqt. This is an imperfect form of a theorem to\be 
proved later (Theorem 315). \ 

The theorems concerning SJl, deduced from Holder’s inequality 
(Theorems 16 and 17) are unchanged, except that the statement 
of the second exceptional case of Theorem 16 must be modified 
in accordance with our remarks at the end of §5.2. Here we 
must say ‘unless «;g0 and S!)l,(o) = 0’. 

One new point of greater interest arises in connection with 
this group of theorems. There is a theorem, suggested by Theorem 
15 but not a corollary of it (even when it has been extended to 
infinite series), which has no analogue for finite sums. 

161 .5^ If k> 1 and Soft is convergent for all b for which S6*' is 
convergent, (hen So*' is convergent. 

We deduce this from another theorem due to Abel», which is of 
great interest in itself. 


162. If is divergent and 

■^n~®i+®a+ •••+*»»» 


* Landau (1)* ^ 

^ Abel (1). There are theorems of the same type involving an arbitrary function 
f{x). Thus, if is divergent, f{x) is positive and deoreasing, and 



then the convergence of I involves that of ha^f{A^), and the divergence of I in* 
volves that of see, for example, de la Vall^ Poussin (1, 398-399), 

Littlewood ( 1 ). This theorem, though of a more general character, does not actually 
include Theorem 162: it is not true that the divergence of I necessarily involves 
that of For an example to the contrary take 






so that A„-^oo, then 
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(i) S ^ is divergent. 


(ii) S - - is convergent for every positive 8, 

"n 

(i) We have 


*n+2^ _ .j. 




>4 — >1 >1 


•^n+l '^n+2 "n+r -^n+r ■^w4r 

which tenda to 1 when n is fixed and r-*-oo, and is therefore 
greater than | for any n and some corresponding r. This proves (i). 

(ii) We may obviously suppose 0 < 8 < 1. Then 

V V / ^ ^ \ 

A MS ~ \A S j 8/ 

-“»-l \-“n-l -“n / 

is convergent. By Theorem 41, the numerator on the left is not 
less than 8.4„s-i _ A,^_^ = 8a„4„s-i. It follows that 


V 

^A A S 

is convergent. We prove in fact a little more than (ii). 

To deduce Theorem 161, write 

o*'=tt, ab = uv, b^ — ui^. 

We then have to prove that, if is divei^nt, there is a 
such that is divergent and convergent. We take 

«„=l/t7„, where ?7 „=«i+M 2+. and the conclusion 
follows from Theorem 162. 


5.5. The means 9)1,. {continued). There is little to be added 
about the means 9)1,, but one or two further remarks are re- 
quired. We begin with a remark concerning the generalisation of 
Theorem 4. This theorem, in so far as it concerns positive r, 
must be interpreted as follows: ifiheaarebounded,and a*=Maxa 
is Uieir u/pp&r bound, then 

mr^m^=a*, 

when r-»« -1- 00 ; if the a are unbounded, but 9)i, is finite for aU positive 
r, then 9)l,->oo. 
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The question of the continuity of iDt,. for a finite positiYe or 
negative r is no longer quite trivial. We state a comprehensive 
theorem, but give no proof, since all the points involved arise in 
a more interesting form in Ch. VI (§§ 6.10-6.11). 

9)lr= 0 for all r (whether C be positive or zerq). We 
exclude this case, and also the case in which ® is meanii^less, 
when 2K,.=oo for r > 0 and S0?,= 0 for r < 0. We write 
S,(a)=log2)i,(a) 

(with the conventions logoo= +oo, logO= — oo). 

163. Apart from the cases just mentioned, the set of values r for 
which S,. (a) is finite is either the null set or a closed, half -closed, or 
open interval («, ®), where — +oo, which has r=0 os 
an internal or end-point, so that u^O^v, but is otherwise arbitrary 
(and in particular may include all real values or none). is + <x> 
to the right, and —co to the left, of J; is a continuous and strictly 
increasing function of r in the interior (if it exists) of I \ and teiids, 
when r approaches an end-point of I from inside I, to a limit equal 
to its wlue at the end-point. 

6.6. The sums ©, . The definition of given in § 2.10 is un- 
changed, and there is little to be said about the theorems concern- 
ing it, though those which involve continuity in r are naturally 
less immediate. Theorem 20 must beinterpreted asmeaning ‘if©,, 
is convergent for some (sufficiently large) r, then it is convergent 
for all greater r and ...*, and Theorem 21 asmeaning ‘if ©,.is con- 
vergent for all positive r (however small) then. . . ’ . The extension 
of Theorem 20 may be proved as follows. If ©jj is convergent for 
a positive R, then o„-»-0, and ©,. is convergent for r>B. There 
is a largest a, which we may suppose on grounds of homogeneity 
to be 1, and we may suppose the a arranged in descending order. 
If then 01 = 02 =. ..=o^=l>®jv+i» 

we have ©i.= (W-l-o^ 4 .iH-Ojy 4 . 2 -l- ..•)^» 

for r>R. The series here lies between 1 and 

■^+‘^+i+®*+2 + — » 

from which the theorem follows. 

There is one new theorem (trivial in the finite case). 
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164. J/©jj ia convergent, then ©, is eontiniumafor r> It and eon- 

tinuous on the right for r= R. is divergent, but convergent 

forr> R, then @,->oo when r-^R. 

The proof may be left to the reader. 

6.7. Minkowski’s inequaiity. The main arguments of 
§§2.11—2.12 require no alteration. Theorems 24—26 suggest a 
further generalisation, with both summations infinite. 

165. “ // r > 1 and is not of the form b„,c„ , then 

n m m n 

The inequality ia reversed when 0 < r < 1 . 

There is no real loss of generality in supposing p = 1, = 1, and 
the proof goes as before. Similarly, corresponding to Theorem 27, 
we have 

166. S > S 

n m m n 

if r>l (with a reversed inequality if 0<r<l), unless, for every n, 
= 0 for every m save one. 

5.8. Tchebychef’s inequality. As one further illustration 
we take Tchebychef’s inequality (Theorem 43). 

We may suppose Sp = 1. The identity 

shows, provided of course that neither (o) nor (6) is null, (i) that 
if (a) and (6) are similarly ordered then the convergence of 2pa6 
implies that of 2pa and 2p6, and (ii) that if (a) and (6) are 
oppositely ordered then the convergence of Spo and Sp6 implies 
that of Spoh. In either case we may put »=qo in the identity, 
and our conclusions follow as before. 

6.9. A s umma ry. The theorem which follows is substantially 
an enumeration of the principal theorems of Ch. II which remain 
vab'd, with the glosses which we have explained in the preceding 
sections, for infinite series. 


Her^ as in Theorem 26, we abandon onr usnal oonrention about g* 
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167. Theorems 1, 2, 3, 4, 5, 7, 9, 10, 11, 12, 13, 14, 15, 16, 
17, 18, 19, 20, 21, 22, 24, 25, 27, 28 and 43 remain valid 
when the series concerned are infinite^ provided that the ineqmlities 
asserted are interpreted in accordance with the conventions laid 
down in § 5.1, and that the statement of the eocceptional cas^ in 
Theorems 1, 2, 5, 10, 16, 24 and 25 is modified in the manner 
explained in § 6.2. \ 

It may be worth while to supplement the last clause of the theorem 
by a more explicit statement. The last words of the theorems mui^ ^ 
replaced by ‘or efee r<0 and S(n,(o) = 0’, \ 

(2) ‘or ®(o) = 0’, 

(5) ‘or and ailr(o) = ^’» 

(10) ‘or(2)(5(a + 6 + ... + Z) = 0’, 

(16) ‘or and 211,(a) = 0*, 

(24) ‘or r^O and 2R,.{a + 6H“... + Z) = 0*, 

(25) ‘or r<0 and (S(a + 6 + ...-f Z)’')^’'=0*. 

We may add also that (as is explained in § 6.4) most of the theorems 
referred to in Theorem 167 (especially those concerning SDIr) may be 
derived by specialisation from the corresponding theorems for integrals. 
In Ch. VI, however, we often ignore negative values of r. 


MISCELLANEOUS THEOBEMS AND EXAMPLES 

The theoroms which follow are for the most part connected with 
Theorems 156 and 157. We suppose in Theorems 168-175 that/(a:) and 
g{x) are inverse functions which vanish for a:= 0 and increase steadily as 
X increases, and that 

Fix)=lj{u)du, 0{x)=ly(t)dl. 

168. If SE(an) and SG'(6n) axe convergent, then La„&„ is convergent, 

“d • Sa,6,gLf’(aJ + 2:(?(6.). 

[Corollary of Theorem 156.] 

169. If Sa„/(a„) and (6«) are convergent, then Sa„6„ is convergent, 

2:a,6,^Sa„/(o,) + 2:6,y(6,). 

[Corollary of Theorem 157,] 

170. It is possible to choose / (and so g, F, O) and in such a manner 

that 21 ^( 0 ,,) is divergent, but convergent for all 6„ for which 

S<7(6n) is convergent. 

171. It is also possible to make San/Ca^) divergent, but aon- 

vergent whenever S&„^(6„) is convergent. 

[The point of the last two theorems is to show that Theorems 168 
and 169 have no converses in the sense in which Theorem 161 gives a 



INFINITE SERIES 


125 


converse of HSlder’s inequality and the convergence test deduced from it. 
Theorem 171 is proved by Cooper (3), and Theorem 170, which includes 
Theorem 171 and is a little stronger, may be proved in the seune manner.] 

172. If £ iQg^iji^ ^ ^ convergent, then S convergent. 

[Cooper (3): Theorem 172 is used in Cooper’s proof of Theorem 171.] 
178. If g(x) satisfies the inequality 

g(3sy)^g{x)g(y), 

and if J^n^n i® convergent whenever is convergent, then 

'SiOnfian) is convergent. Similarly, if is convergent whenever 

LG'(6„) is convergent, then l,F{an) is convergent. 

[See Cooper (3) for the first form which in this case is stronger, the 
second form being a corollary.] 

174. If San&n i® convergent whenever S(?(6„) is convergent, then there 
ts a number A =A(a), depending upon the sequence (a), for which U^{Xan) 
is convergent. 

175. If the conditiona of Theorem 174 are satisfied, and ^ 

for small a;, a c> 1, and some k, then is convergent. 

[For the last two theorems see Bimbaum and Orlicz {!).] 

176. If On and tend to zero, k is positive, and 


Si 


'logd’K)’ 

are convergent, then is convergent, 

[Use Theorem 169.] 




177. If a;>0, a„^0, and — then f(x) is a convex function 

of X and log/(a;) of logic. 

[Plainly /"(a?) ^0. To prove the second result, lot a? = e~*',/(ic)=sflr(y). 
Then £„2a„e-«» - {2»»a„e-»»)= S 0, 

by Theorem 7. The result follows from Theorem 1 18.] 


178. If On^o, An>An.«i^0, and /(jc) = then log /(a:) is a 

convex function of x. 


179. If a,|> 0 and A«, a?, y, ... , s are real, then the domain/) 

of convergence of the series 

=/{a;, 2) 

is convex, and log/ is a convex fimction of a?, y, . . . , 2 in U. 

[Beoadse (by the extension of Theorem 11 to infinite series) 

f{xit-i^x^{l-i),...,Zit+z^(l’-t)}^{f{Xx,...,Zi)Y{f{Xt «*)}^-** 

Here our conventions concerning convergence are important.] 

180. Sa,*<2(Sn*o„*)*(S(an-a„+i)*)*. 

unless a„ ss 0 for all n. 

[See Theorem 226.] 
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INTEGRALS 

6.1. Preliminary remarks on Lebesgue integrals. The 
integrals considered in this chapter are Lehesgne integrals, mcept 
in §§6.16-6.22, where we are concerned with Stieltjes intej^ls. 
It may be convenient that we should state here how much mow- 
ledge of the theory we assume. This is for the most part Very 
little, and all that the reader usually needs to know is that there 
is some definition of an integral which possesses the properties 
specified below. There are naturally many of our theorems which 
remain significant and true with the older definitions, but the 
subject becomes easier, as well as more comprehensive, if the 
definitions presupposed have the proper degree of generality. 

We take for granted the idea of a measurable set, usually in one 
but occasionally in more dimensions. The sets which we consider 
may be bounded or unbounded. The definition of measure applies 
in the first instance to bounded sets: an unbounded set is said to 
be measurable if any bounded part of it is measurable, and its 
measure is the upper bound of the measures of its bounded 
components. 

We shall generally assume, without special remark, that any 
set E with which we are concerned is measurable. We denote the 
measure of E by mE or sometimes, where there is no risk of 
ambiguity, simply by E. If E is unbounded, mE may be oo. 

We also take for granted the idea of a measurable function. 
Sums, products, and limits of measurable functions are measur- 
able. All functions definable by Uie ordinary processes of analysis 
are measurable, and we shall confiine our attention to measurable 
functions; we shall not usually repeat e3q>lioitly that a function 
which occurs in our work is assumed to be measurable. 

Next, we take for granted the definition of the integral, of a 
bounded or unbounded function, over any (bounded or un- 
bounded) interval or measurable set of points. A bounded measur- 
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able fonotion ia integrable over any bounded measurable set. 
We call the class of (bounded or unbounded) functions integrable 
over the interval or set E in question the class L or, if it is desir- 
able to emphasise the set in question, the class L{E).l£f belongs 
to L, we say that ‘/is i’, and write/ e L (and similarly for other 


If/=1 then 



mE, 


If/ e L, then |/| eZr. If/+ and/" are the functions equal to/ 
when / is positive and negative respectively, and to zero other- 
wise, so that 

/+=Max(/,0), /-=Min(/,0), /=/++/-, |/| =/+-/", 

then/+ € L and /~ € L, and® 

Udx=U+dx-¥if-dx, i\f\dx=U*dx-lf-dx. 

If /fe 0, and (/)„ = Min (/, n), then 

ifdx= lim/(/)„da: 

n->-oo 

(substantially by definition). 

lif e L, and (gr is measurable and) I? | <C?|/|, thengr e L. 

If/1,/2 /ncZthen 

J K/i + Oj/a -t- . . . -H o„/„) da; = a J/ida: + o J/gda: + . . . + a„;/„da;. 

Ifp > 0 and (/is measurable and) |/ 1** e Z, we say that/belongs 
to the Lebesgue class Lfi, or /e IP. These classes are most im- 
portant when p ^ 1. IA is identical with Z. 

If the integration is over a finite interval (or bounded set), then 
LP includes every Lfl for which q> p\ f e IP implies felP, A 
bounded function belongs to every Z«. These propositions are not 
true for an infinite interval; / may belong to IP, in (0, co), for one 
value of p only. 

If the interval is finite suid ftL^, j><?, then 1/1’’< 1 + 1/1*, so that 
ftLP. 

If the interval is (0, a), where o< 1, then (o) belongs to for 
every 8>0, but not to (6) (log * belongs to L», but not to 


• We state the resolts for one variable and omit explicit reference to the range or 
ckf integration. 
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Xr*»+«; (c) log (1/aj) belongs to every and (d) belongs to no LK If 
the interval is (0, oo), then (1 + ] log a? |)-^ belongs to L* but to no 
other class L*. 


6.2. Remarks on null sets and null functions. A set of 

measure zero is called a nuU set. Null sets are negligible/ in the 
theory of integration, lif—g except in a null set, we saty that 
/and g are equivalent, and write /=gr. Equivalent function^ have 
the same integral (if any). 

If /s 0, we call / a null function, and say that / is null. 

Similarly, we define ‘ equivalent inE',* null in -B ’ :/is null ik E if 
/= 0 at all points of E except the points of a null set. In such cases 
we shall not repeat the reference to E when the context makes it 
obvious, as for example when we are considering integrals over E, 

If a property P (x) is possessed by all x except the a: of a null set, 
we shall say that it is possessed by almost all x, or that P (x) is 
true for almost all x, or almost always. Thus a null function is 
almost always zero. 

We shall generally assume that our functions f,g, are almost 
always finite ; but there will be occasions when we have to consider 
functions infinite in a set of positive measure. Thus if /is generally 
positive, but zero in a set E of positive measure, and r < 0, then we 
must regard/** as infinite in E, and J/*'dx as having the value oo. 

If E is null then r 

I fdx=^0 

JE 


for all /. We shall assume without special remark that a set E 
over which an integral is extended is not null. 

then a necessary and sufficient condition that J fdx — 0 is 
that f should be null. 

It may be worth while to call attention explicitly to the theorem which 
replaces this in the theory of Kiemann integration. We denote the class 
of Itiemaim integrable functions by B (it is contained in L). A necessary 
and sumcient condition that / should be P is that / should be bounded 
and that its set of discontinuities should be null. 

Jf f ia R, and /^O, then a necessary and sufficient condition that 
J/da? = 0 w thatf:s^ 0 at aU points of continuity off. 

For ( 1 ) if the condition is satisfied, then/ = 0 and so ffdx = 0. And (2) if 
it is not satisfied, then there is a point of continuity ( at which f(() > 0, 
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and an interval including f throughout which so that 

//da?>0. 

This theorem enables us to specify the cases of equality in our in- 
equalities when they are restricted to functions of B. In fact most of our 
theorems have a dual interpretation. The primary interpretation is that 
in which the integrals are Lebesgue integrals and ‘null’ and ‘equivalent’ 
are interpreted as in the theory of Lebesgue. In the secondary inter- 
pretation integrals are ‘ Riemann integrals a ‘ null function ’ is a function 
which is zero at all its points of continuity, and ‘equivalent fimctions’ 
are functions whose difference is null in this sense. 


6.3. Further remarks concerning integration. What has 
beensetoutin §§ 6.1 and 6.2isasufficientfoundationfor most of the 
subsequent theorems: for example, for the most complete forms 
of Holder’s and Minkowski’s inequalities (Theorems 188 and 
198). There will be a few occasions on which we shall have to 
appeal to more difficult theorems, and we enumerate these here. 

(a) Integration by parts. The theorem required is: iff and g are 
integrals {absolutely continuous functions), then 

Vfg'dx=\fgl- ['f'gdx. 

J a L Ja J a 


(6) Passage to the limit under the integral sign. The two main 
theorems are 

(i) If 1 «„ (jc) 1 < <^ (®), where <f>€L, and «„ (*) tends to a limit s (x) 
for all or almost all x, {hen 

JSn(x)dx-*-js{x)dx, 

(ii) If s^ {x) eLfor every n, (x) increases toUh nfor aU or almost 

aU X, and (j.) =8(x), 

then jSn{x)dx-*-j8(x)dx. 

In (ii) the integral on the right may be infinite, when the result 
is to be interpreted as ^8j^{x)dx-*(X); in particular this happens 
if « (x) = 00 in a set of positive measure. In each of these theorems 
n may be an integer which tends to infinity or a continuous 
parameter which tends to a limit. 


It follows from (i), as is shown in books on the theory of functions of 
a real variable, that a function /(x) whose incrementary ratio 

I(x+h)-Hx) 


HI 
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is bounded (and which therefore has a derivative almost everywhere) is the 
integral of its derivative. Combining this remark with that at the end of 
§ 3.18 we see that a continuous convex function f(x) is the integral of its 
derivative /'(oj), or of its one-sided derivatives// (a:),/,.' (a;). It is therefore 
the integral of an increasing function. On the other hand, if S(x) is the 
integral of an increasing function g(x)^ and A> 0, then j 

/x+h tx I 

/(a;-f A)-/(a ;)= j ^ g(u)du'^ j ^g{u)du=f(x)-f{x--h)\ 

so that f(x) is convex. Hence the class of continvaus convex June 
identical with that of integrals of increasing functions. \ 

An increasing fimction belongs to R, so that the integrals in question 
exist as Riemann integrals, and the theorem could be proved without 
any use of the theory of Lebesgue. 

(c) Svbatitution. The standard theorem is: if f and g are in- 
tegrablCf 0 is an irdegral of gr, and a=^0 (a), b^O (j3), then 

(6.3.1) (* f(x)dx=‘ r f{G{y)} 9 {y)dy. 

J a J a 

Here any of a^ 6, a, j8 may be infinite. 

This theorem covers all cases in which we shall require the rule 
for transformation of an integral by change of the independent 
variable®. But we shall generally use only trivial substitutions 
such QA x^y'^a ov x^ay, when the validity of the rule follows 
at once from the definitions. 

(d) Multiple and repeated integrals. The only theorem appealed 
to is ‘Fubini’s Theorem*. If f{x,y) is (areally measurable and) 
non-negative, and any one of the integrals 

f"" fdxdy, \^dx T/dy, \^dy fdx 

I a V b J a J b J b Jo 

exists^ then the other integrals exist, and all are equal. Here the 
limits are finite or infinite, and the case of divergence is included; 
if one integral diverges the others diverge. 

• We may add two additional remarks concerning the formula (6.3.1). 

(1) If we suppose, as in the text, that g is non-negative and Integrable, but 
assume only the measurability (and not the integrability) of /, then the existence 
of the right-hand side of (6.3.1) is a sufficient, as well as a necessary, condition 
for the existence of the left-hand side, i.e. for the integrability of /. 

(2) The integrability of /(*), though it implies that of f{Q(y))g(y)f does not 
imply even the measurability of / {Q (y)}. 
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Suppose then that f{x,y) is measurable and non-negative. 
The double integral is zero if and only if }(x,y) is null, i.e. if 
the set in which / (x, y) > 0 has measure zero. The first repeated 
integral is zero if and only if /(*, y) is, for almost all x, null in y; 
and the second if/ (*, y) is, for almost all y, null in x. Hence these 
three senses of ‘ a nuU non-negative function of two variables ’ are 
equivalent. 

6.4. Remarks on methods of proof. Inequalities proved for 
finite sums may often be extended to integrals by the use of 
limiting processes, but something is usually lost in the argument. 
We may Uliistrate this by considering the analogue for integrals 
of Theorem 7. 

Suppose first that/(x) and g (x) are non-negative and Riemann 
integrable in (0, 1); and take 



in Theorem 7. Dividing by we obtain* 



and, making 

(6.4.1) fgdxj g j ^f^dx j ^g^dx. 

If we use the Lebesgue integral we must argue differently**. 
Suppose that / and g are non-negative and L* in (0, 1), and that 
^ is the set in which 


— (r,s=1.2,3,...). 

n n n n 






by Theorem 7. Now 


‘ It is here that ‘homogeneity in S’ (§ 1.4) is essential. , ,, tt n 

^ The precise form of argument used here was suggested to us by Mr H. D. Urselu 

9-a 
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and there is a similar inequality involving g. Henoe, making 
«->oo, we obtain (6.4.1). 

In either case our final result is imperfect. Even if we can use 
Theorem 7 with ‘ < this will degenerate into ‘ ^ ’ when we pass 
to the limit, and we shall lose touch with the cases of equality. 

The passage in the opposite direction, fix>m an integim in- 
equality to an inequality for sums, is much simpler, and c^ be 
effected by suitable specialisation. Consider, for example! the 
inequality 

(6.4.2) /(*)<^ 

(§6.7, Theorem 184). If 

g„>0, gi+? 2 +- + ?«=!. 
and we define f(x) by 

/(a:)=o„ (?i+... + g._i^a5<?i+... + g,-i+gv). 
it being imderstood that ... + gv-i nieans 0 when v= 1, we 
obtain Theorem 9. The conditions imder which inequality de- 
generates into equality in Theorem 9 also follow immediately 
from the corresponding conditions for (6.4.2). 

This method of proof is often useful, since integrals are often 
more manageable than series. We shall meet with examples in 
Ch. IX. 


6.6. Further remarks on method: the inequality of 
Schwarz. We meet the difficulty of § 6.4, as in our treatment 
of infinite series, by going bcusk to the proofs of the theorems of 
Ch. n, and observing that, with the obvious changes, they can 
be applied to integralsof themost general type. We may illustrate 
the point here by considering ‘Schwarz’s* inequality’, the ana- 
logue of Theorem 7. 

181. (J‘/flrda:)*<//®da:/g*d*, unless Afs Bg, where A and Bare 
constants, not both zero. 

Here, and later, we suppress the limits of integration when 
there is nothing to be gained by showing them explicitly; they 
• Or Bttniakowsky’a (see p. 16 ), 
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may be finite or infinite, or the integrals may be over any measur- 
able set E, in which case of course Afs Bg means Afs Bg in E. 
We also adopt conventions corresponding to those of §6.1: 
‘X < Y’ means ‘if F is finite then X is finite and X<T’; and 
inequalities of other forms like those mentioned in § 6. 1 are to be 
interpreted similarly. Thus every inequality contains implicitly 
an assertion about ‘convergence’, which we shall only make ex- 
plicit occasionally. For example. Theorem 181 asserts implicitly 
that ‘if iPdx and are finite, then ^fgdx is finite; if / and g 

are 2i*, then, fg is L\ 

The proofe corresponding to those of § 2.4 run as follows. 

(i) We have 
J/*da:/ g^dx- (ifgdxf 

= g*{x)dxlP(y)dy 

-if(x)g{x)dx]S(y)g{y)dy 

= i J ^yl {f(^) g(y)-g {x)f{y))*dx ^ o. 

It remams to discuss the possibility of equality. In the first 
place, there is certainly equality if AfsBg. Next, if there is 
equality, imd g is null, then AfsBg with ^4 = 0, .B = 1. We may 
therefore assume that g is not null, so that the set E in which 
9r(y) 4:0 has positive measure. If 

/ <*y/ {/(«) g^)-g (»)/{y )}*«^ = ^ 

then 

(6.6.1) i{f{x)g{y)-g{pc)f(y)}*^=^ 

for almost all y, and therefore for some y belonging to E. We 
may therefore suppose that $r(yQ) + 0 and that (6.6.1) is true for 

/(®)flr(y,)-?(*)/(yo)=0 
for almost all x, and this completes the proof. 

(ii) The quadratic form 

is positive. We can now complete the proof as in § 2.4. 

The . analogue of Theorem 181 for multiple integrals may be 
proved similarly. We shall not usually mention such extensions 
explicitly, but we shall occasionally take them for granted. It 
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is to be understood that, when we do this, the extension may be 
proved in the same manner as the original theorem. 

We can translate the proof of Theorem 8 in the same manner, 
and so obtain 

182.* Sf^dx ifgdx ffhdx >0, 

Jhfdz ^hgdx ... Jh^dx 

unless the functions fy g, h are linearly dependent, i.e. uMcss 
there are constants A, B, ..., C, not all zero, stick that 

Af + Bg + . . . + Ch =0. 


MEANS 8D^(/) 

6.6. Definition of the means when r4:0. In what 
follows the sign of integration, used without specification of the 
range, refers to a finite or infinite interval (a, 6) or to a measurable 
set Efi f(x) is finite almost everywhere in E and non-negative; 
p (*), the ‘weight function ’ , is finite and positive® everywhere in 
E, and integrable over E. The parameter r is real and not zero. 

Our hypotheses involve 0<Jpdaj<oo. It is often convenient 
to suppose lpdx=\: 

in this case (cf. § 2.2) we write qioxp. 

We write 

(6.6.1) aR,(/) = !0i,(/,p) = (^^)'^ (r+0). 

(6.6.2) Sl(/) = aRi(/), 

BO that 

(6.6.3) 

with the following conventions. If J pf'dx is infinite, we write 
iprdx=oo, SUi,(/)=oo (r>0), aR,(/)=0 (r<0). 

• Gram (1). 

^ When f > 0 we can reduce every case to that of the interval ( - co, ao)» by 
supposing /sO in the set complementary to E, 

0 The hypothesis p ^ 0, instead of p>0, would lead to slightly difEerent results 
concerning the oases of equality (e.g. pf^pG instead of /=(7). This case could be 
reduced to the apparently more special case by replacing E by the sub-set of E in 
which |i>0. 
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In partioulax 2Hy(/)=0 if r<0 and /=0 in a set of positive 
measure. If we agree further to regard 0 and oo as reciprocals of 
one another, we have 

This formula enables us to pass from positive to negative r, and 
we shall simplify the following theorems by restricting ourselves, 
for the most part, to positive r. 

If/=0, iDl,{/) = 0 for all r. If/=C, where C is positive and 
finite, then 9K, (/) = C for all r. K/= oo,» then 3)1, (/) = oo for all r. 
Apart from these oases, 3Jl,(/) = oo is possible only when r>0, 
and 9JI, (/) = 0 when r < 0. 

We define Max/, the ‘effective upper bound’ off, as the largest 
^ which has the projwrty : 

‘if « > 0, there is a set e (e) of positive measure in which /> ^ — e’. 
If there is no such $, we write Max/=oo. For functions con- 
tinuous in a closed interval, Max/ is the ordinary maximum. 
Min /is defined similarly; Min/^ 0 and 

( 6 . 6 . 5 ) 

Equivalent functions have the same Max and Min. 

Suppose for example that the range of integration is (0, oo), and that 
f(x) and q{x) axe the step functions defined by 

f{x)=an^ q{x)-qn (n-lga;<n, n= 1, 2, 3, ...). 

Then 9W,(/) = {ZqarYf^ = 9[Rr (o)t 

according to the definition of (5.2.1). Similarly 

Max/=:Maxa, Min/=Mino, 

and (if we anticipate the definition of § 6.7) (5 (/) = (5 (a). This specialisa- 
tion enables us to include many theorems of Ch. II and Ch. V in the 
corresponding theorems of this chapter. 

Alternatively we might (as in § 6.4) suppose that the range of integration 
is (0, 1), and define /(a;) and q(x) by 

/(a?)=a« (gi + ...+gn-i^»<^i+-+«n)» 

In this case also Sllr(/) reduces to 

^ To admit this case is to abandon momentarily the understandi n g of § 6.2, that 
/ is assumed to be finite almost everywhere. 
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188. If and Wtr(J) is finite and positive, then 

Min/<aR,(/)<Max/, 

unless fsG. 

Here r is of either sign. The proof is like that of Theor 
Suppose first that r= 1. Then, using a weight function g(a!(), we 

!q(f-mdx=0. 

Hence either /s 91 or/- 91 is positive and negative, each in ^ set 
of positive measure. This proves the result for r® 1, and we extend 
it to the general case by use of (6.6.3). 

Jf we wish to state Theorem 183 in a form corresponding more 
exactly to that of Theorem 1 and its extension in Ch. V», we 
must say ‘Min/<3Rr(/)<Max/ unless f^G or else r<0 and 
2IJtr{f) — 0\ We have then two cases of equality corresponding 
exactly to those distinguished in §5.2, the ^primary* case in 
which fs G, in which both inequalities reduce to equalities, and 
the ^secondary* case, occurring only for r<0, in which one 
inequality only reduces to an equality. This distinction recurs 
in many of our theorems, when r < 0, as it recurred in Chs. 11 
and y ; but it is less conspicuous here because we often ignore 
negative values of r. 


6.7. The geometric mean of a function. We define the 
geometric mean ® (/) by 

(6.7.1) = 
or 

(6.7.2) log ® (/) = at (log/), 

so that, in particular, ifp«g,/gda?= 1, we have 

(6.7.3) 3 (/) = log @ (/) « J qlogfdx. 

Certain preliminary explanations are necessary. 

Since log /is not necessarily positive, the possibilities concem- 
ing the convergence of Q are more complex than those which we 
have considered hitherto. 


Sees 5.2. 
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If we denote by 3f+ and Qf- the integrals formed with log+/ and 
log" /, as is formed with log/,* then there ate fonr possibilitieB: 
(a) 3+ and l>oth finite, (6) 3+ finite, 3" = -oo. (c) S'*" = 00 , 
3” ^te, (d) 3'*' = 00 , 3~ = — 00 . The four oases are exemplified 
by the functions 



in (0, 1), with q(as)=l. 

If fDtr(/) is finite for some r > 0, then, since 

log+fg Max , oj , 

3'*'(/) will be finite, and we shall be concerned only with oases (a) 
and (6). In case (a), 3 (/) exists as a Lebesgue integral, and ® (/) 
is positive and finite. In case (b) we write 

3(/)=-«>. ®(/)=o. 

Similarly, if Tlr(l//) is finite for some r > 0, we are in case (a) or 
case (c); in the latter we write 


3(f) = °o, ®(/) = oo. 

In case (d) the s 3 aubol ® (/) is meaningless. In this case 911, (f) 
and 211,(1//) are infinite for every positive r, and 21l,(/) = 0 for 
every negative r. 

In case (a) we have 

(6.7.4) 

both sides being positive and finite ; and a moment’s consideration 
shows that this equation holds in all oases, if we adopt the same 
convention about 0 and 00 as in (6.6.4), and Hie additional con- 
vention that one side of the equation is meaningless if the other 
is meaningless. 

We now prove the analogue of Theorem 9. 




184 . If %{f) is finite then 
(6.7.6) ©(/)<«(/). 


• See { 5 for the definitions of log+ end log~* 
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urdew fsO, where 0 ia eonatant. More generally, if 9Kr(/)» 
where r>0, ia finite, then 

(6.7.6) @(/)<aR,(/). 

wUh die aame reaervation^. I 

Suppose first that r=l, SDlr=2[. If 9l(/)=0, /aO, and so 
3(f)= — 00 , ®(/) = 0=3l(/). Wemaythereforesuppo8e2l(/|>0. 
Since, by Theorem 150, \ 

(6.7.7) log*<<— 1, \ 

if < > 0, 1+ 1» we have 

log/- log n(f)S - 1, 

a(log/)-log2l(/)g||^]-l = 0. 

log@(/) = 9l(log/)glogSl(/). 

Equality can occur only if /s St (/). 

The result for general r now follows from (6.6.3)* 

In Theorem 184 we have stated the hypotheses *if 9l(/) is 
finite*, ‘if SJIrC/) is finite* explicitly. As we have explained in 
§§ 6.1 and 6.6, we shall often save space by omitting such hypo- 
theses in accordance with our conventions. We shall also denote 
constants by (7, A, jB, a, 6, . . . without explanation, when there is 
no danger of ambiguity. Two (7*8 occurring in the same con- 
nection will not necessarily be the same. 

We add two corollaries (extensions of Theorem 10). 

186. <5(/) + (5(gr)<(5(/+fir), unless Af=B^, where A, B are not both 
zerot or 

We may suppose that (5 (/+ g) > 0. Then, by Theorem 184, 

The addition of the two inequalities of this type gives the result. 

More generally 

186. (5(A)+®(/,) + (5(/,) + ...<(5(/i+/,+/t+...) 

(the eeriee hemg finite or infinite), unless /„= (7,iS/„ ^ ®(2J/n)=»0. 

* For the proof which follows see F. Biais (7). 
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6.8. Further properties of the geometric mean. Onr 
next theorem corresponds to Theorem 3 (for positive r). 

187. If 9Jlr (/) is finite for some positive r, then 

(6-8.1) 3Kr(/)->@(/) 

when 0. 

It should be observed that ©(/) may be finite even when 
SJlr (/) = 00 for all r > 0. This is so, for example, if f(x) = exp {«“*), 
q{x) = l, and the range is (0, 1). 

When E ia a, closed interval or set, and / is continuous and 
positive, the proof is immediate. In this case /^8>0, log/is 
bounded, and 

SKr' = J ^ gd* = ]■ [1 + r log /+ 0 {r® (log /)*}] qdx 

= l + r;3+0(r®), 

Urn log 9DI, = lim i log {1 + r:3 + ^ = S- 

There is some difficulty in extending this argument to the 
general case. The difficulty can however be avoided as follows®. 
By (6.7.6) and (6.7.7), we have 

(6.8.2) log@(/) glog2R,(/) = Jlog2l(/0 

When r decreases to zero, (f — l)/r decreases (byTheorem 36) and 
tends to the limit log t. Hence^ 

(6.8.3) lim i {9t (/«•) - 1} = 91 (log/) = log ® (/), 

the right-hand side being finite or —co. Combining (6.8.2) and 
(6.8.3) we see that 

log @ (/) ^ Hm log Ttr{f)i ES log 2)lr (/) ^ log ® (/), 
which proves the theorem. 

6.9. Holder’s inequalityforintegrals. Weconsidernextthe 
• P. Biesz (7). Other, less simi^ proofs have been given by Besioovitoh, Hardy, 
and Littlewood: see Hardy (7). 

*> See §6.3(6) (ii). 



140 


INTBOBAIiS 


theorems for integrals which correspond to Theorems 1 1-15. It is 
convenient to introduce another definition which enables us to 
shorten our statements of cases of equality. Two functions /, g 
will be said to be effectively proportional if there are cons^tants 
A, By not both zero, such that AfsBg. The idea has oodorred 
already in Theorems 181 and 185. A null function is effecttvely 
proportional to any function. We shall say that /, g, h, ..\ are 
effectively proportional if every pair are so. \ 

188. If (X., p, ...,X are positive and a. + p+...+X=l,thm \ 

(6.9. 1 ) ;/“/ ...l^dx< (J/da:)“ (J gdx )^ . . . (/ 

unleae one of the functions is null or all are effedively proportional. 

Assuming no function null, we have, by Theorem 9, 




if-gP... lHx iJLYdx 

)'^lfgdxf ...{jldxf' J\j/da;/ \igdx) 


with inequality unless 


^J( 


jfdx^igdx^ 




dx= 1, 


/ _ =.J_ 

ifdx fgdx ]ldx' 


As a corollary* we have 


189. If k> I then 

(6.9.2) ifgdx< (//*da:)V* {^^dx)^’^ 
unless and g^ are effectivdy proportional. 

If 0<k<l or k<0 then 

(6.9.3) ifgdx>lff’‘dx)^'‘ijg*^dx)^*' 

unless either (a) /* and ^ are effectively proportional or (b) fg is 
null. 

The second half of the theorem requires a little explanation. 
Suppose first that 0<ik< 1 and that i^dx is finite, so that g 
is almost always positive. If then we write Im^l/k, so that 
Z>1, and f={uvY, g-v-*, 

so that fg==u*, /*««v, 


* Compare § 2*8. 
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then tt and v are defined for almost all x, and 
Suvdx<Uu‘dx)Vf(fv>’dx)V>' 
or SPdx < (ffgdx)>‘ (Sgf^dx)^-\ 

unless «*, v’' are effectively proportional or, what is the same 
thing, unless /*, are effectively proportional. Since Jg^dz ia 
finite and not zero^ this is (6.9.3). 

If J'gf*^da:=oo, then 

(since i' < 0). Hence the right-hand side of (6.9.3) is zero, and 
there is inequality unless ffgdx=0, at fg is null. 

When i: < 0, 0 < i' < 1, and the argument is substantially the 
same. 

As we have explained in §§6.1 and 6.6, the theorem contains 
implicitly an assertion about convergence or finitude; if two of 
the integrals involved are finite, then so is the third. The integral 
which is finite if the other two are finite is ^fgdx when jfc> 1, 
if^dx if 0 < 1; < 1, and Jgr^'da: if ifc < 0. 

The theorem corresponding to Theorem 161 is very important, 
and, like Theorem 161, is not a direct corollary of preceding 
theorems. 

190 .*> Ifk>l and fg belongs to Lfor every g which belongs to L*', 
then f belongs to L*. 

We consider fibrst the case in which (a, 6) is finite (or mE finite), 
and suppose that//*tia:=oo. We can find a function/* which 
(1) has only an enumerable infinity of values o<, and (2) satisfies 
f*^f<f* + €. Since /* does not exceed a constant multiple of 
t>y Theorem 13, we have J‘/**'da:=oo. Hence, if 
c< is the set in which/* = , 

Sa<*c^=QO. 

It follows from Theorem 161, taking 

* Jf*" (foaO would involve *0 end so gmao, and this possibility is excluded 
by the undexstandlng of { 6.2. 

** F. Biess (2). 
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that there is a 6^ such that is oonveigeiit and asoo. 

We take g (z) — bfiaef (for all t). llxen 

Jg«^da;=S6,fc'e* 

is convergent, but 

// Vda;= Sa<6,c< = 00, 

and hence J/g'da;= 00 , contrary to the h 3 rpothesis. 

If the integrals are over an infinite range, say (0, oo), can 
write 



when 

f"/fird»= r^Odt, ("f^dz^ r^^dt, f Ccf^dt, 

Jo Jo Jo Jo Jo Jo 

where 

The theorem is thus reduced to the finite case. 

191. If k>l, then a necessary and sufficient condition that 
J/^da; is that ^fgdz^ F^'‘ Q^’^for ail g such that J gr*'da: ^ 0. 

The condition is necessary, by Theorem 189. K it is satisfied, 
then//*da: is finite, by Theorem 190. If J/*da: > F, we choose g 
so that g^ is effectively proportional to /*, and then, by Theorem 

ifgdz= (J/*da:)V* (j gVdz)V’^ > fV* 0^^. 

The theorem may also be stated with ‘ ’ for ‘ ^ ’ in the first 

two inequalities: in order that ff^dx<F, it is necessary and 
sufficient that ifgdx< 0^^ whenever J g^dx ^ G. 

We can prove Theorem 191 without appealing to the more difficult 
Theorem 190. If J/*dsc>JP then J{f)n*dx>F for sufficiently large n. 
TheU) choosing g effectively proportional to (/«)*""*» we have 

in contradiction to the hypothesis of the theorem. 

Another proof of Theorem 190 (and of the associated Theorem 161) 
has been given by Banach (1, 86-86). 

An example of thb use of Theorem 191 appears in § 6. 13, in the proof of 
Theorem 202, and others in Ch. IX.» In § 6. 13 Theorem 202 is proved in 

* See in particular f{ 9.3 and 9.7 (2). 
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two different ways, of which one depends explicitly on Theorem 191 while 
the other does not use it, and the logical status of Theorem 191 in proo& 
of this character is explained in detail. 

6.10. General properties of the means SR, (/). Weshallnow 
prove a number of theorems which include the analogues of those 
of § 2.9. The properties to be investigated are a little more com- 
plex than they were there, and we shall require some additional 
conventions before we can state them comprehensively. We 
suppose first that r > 0; the theorems which we prove in this case, 
with those which we have proved concerning ® {/), will give us 
the substance of what is required, and we shall be able to state 
the results for unrestricted r more summarily, leaving most of 
the details of verification to the reader. 

192. If 0<r< sand 3}tt is finite, then 

9R,<aR„ 

unless /= C. 

If r = «a, so that 0 < a < 1, we have, by Theorem 188, 

/ qfdx=i (/ qfdx)«{i qdx)^-‘= (J qf’dxY, 

unless j/* s Cq. Since g > 0, this is the result required. 

193. IfSItf is finite for every r, tiien ^f-*-M.axf when r-*-+co. 

(i) Suppose /i= Max/ finite. Then (a) and (^) />/*-« 

in a set e of positive measure so that 

J qdx=i>0, KmaR^^/x-e. 

(ii) Suppose Max/*® oo. Then, for any C?>0, />(?ina set e of 
positive measure, and, as above, hmSR, S 0. 

From (6.6.4), (6.6.6) and Theorem 193 it follows that 

aR,.-»-Mm/ 

when r-^— 00 . 

194. If0<s<oo and HR, is finite, then SIR, is continuous for 
ti<r<sand eontinitous on the left for r<=8. If SIR,=oo, but 3R, is 
finite for 0<r<s, then'SH^-^co when r-*-s. 

(i) Suppose aR, finite. Then 

qfr ^qUaxil, f). 
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a majorant of olaas L independent of r; and the results follow 
from §6.8 (6) (i).* 

(ii) Suppose oo. We can choose n so that 
But is a continuous function of r, and so>> 

for r > « — €. Hence J g/^dx > J G, which proves the theorei 

6 . 11 . General properties of the means (amtin/^ed). 
In the preceding sections we have, in the main, confined our atten- 
tion to means for which r k 0, leaving it to the reader to deduce 
the corresponding results for means of negative order from the 
formulae (6.6.4) and (6.6.6). In this section we consider the 
means more comprehensively. We write, as is natural after 
Theorems 187 and 193, 

(6.11.1) @(/) = 2Ko(/), Max/=gji+„(/), Min/=gji_„(/). 

S5lo (/) may 1>« meaningless, but only if 2)1, (/) = oo for all r > 0 and 

2)1, (/) = 0 for all »■ < 0- 

We bsgin by disposing of two exceptional oases. 

(A) If /= C then 211, = C for all r, and this is true even in the 
extreme oases (7=0 and C=(X>fi 

(B) We may have 

211,=0 (r<0), 211o meaningless, 21l,=oo (»’>0). 

These cases we dismiss. We then leave it to the reader to verify 
the truth of the assertions in (1) and (^ below, which cover all 
oases other than the exceptional cases (A) and (B). 

(1) 211,<212:«for -oo < a^oo, unless (o) 2)1,= 211, =oo (which 
can happen only if r g 0), or (6) 211,=211c=0 (which can happen 
only if a ^0). 

* Continuity for f<« can also be deduced from Theorems 111 and 197 (Bee§6*12). 
»> By § 6.3 (6) (i). 

^ Strictly, the second case is excluded by the understanding of { 6 
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(2) We denote by 2R;.+o *te limits (which always 

exist) of S0l< when <~>r from below and from above respectively. 

If r>0 then 2Jlr-o==®i;., and 801^4.0 =9Jlr except when SOlr is 
positive and finite but 9K/ = oo for ^ > r, in which case 

®^r+o=oo>5[)fi^. 

If r<0 then and 911^^0 = 9)1^ except when 9)1^ is 

positive and finite but 9)1/ =0 for t<r, in which case 

^IJtr-o^O<Wtr- 

If r=0 there are exceptional cases corresponding to each of 
those indicated above. If 9)lo is 0 or oo then either (a) 9)l«o and 
9 ) 1+0 are each equal to 3SIq, or else (6) 

9)1^ == 9)io ~ 30?+o =* 00 

or 9)i^=0, 9Jlo = 9)1+0 = 00 . 

If 9)lo is positive and finite, then each of 9Jl_o 9)l+o, if s-lso 

positive and finite, is equal to 9)lo; but 9)l_o ^ ^+o 

may be oo. 

Finally, all possibilities not explicitly excluded may actually 
occur®. 

The results may be stated more symmetrically and concisely in terms of 

£, = log50V; 

we agree that logoo=+oo and log0=— oo. We put aside the cases 
corresponding to cases (A) and (B) above, viz. 

(a) /= C (where C may be 0 or oo), when jC,.=log G for all r; 

(b) Qq meaningless, when = + oo for r > 0 and — oo for r < 0. 

196. Apart from the cases just mentioned, the set of values of r for which 

= log 2R,. is finite is either the null sel or a closed, half -closed, or open in- 
terval I or {u, v), where -oo^w^vgoo, which includes the point r = 0 (so 
that 0^ v), but is otherwise arbitrary (so that, for example, u may be 
— 00 and v be + 00 , or u and vmay both^O). 2r is co for values ofrto 
the right of I and ^00 for values to the left. 

Inside I, Sir is continuous and strictly increasing. If r tends to an end- 
point of I through values of r interior to I, then 2^ tends to a limit (finite 
or infinite) equal to its value at the end-pbint in question, 

8.12. Convexity of log SD^r^ In this section (as in Theorem 17) 
we suppose r>0. 


. See Theorem 231. 
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196. If 0<r< t<t, and ^ is finite, thm 

unless fs 0 in apart of E and /s C in the ccmplemmtary part. 

The proof is based on Theorem 188, and is the analogue of 
that of Theorem 17. For equality, 

qf'^Cqf*. 

As a corollary, we have 

197. log SR,' (/) =r log 9R, (/) is a convex function of r. 

Compare Theorem 87. The reader will find it instructive to 
deduce the continuity of 9)1, (Theorem 194) from Theorem 197. 

6.13. Minkowski’s inequality for integrals. The inequali- 
ties of the Minkowski type are derived in substantially the same 
way as in §2.11. The ordinary form of Minkowski’s inequality 
for integrals is 

198. Ifk>l then 

(6.13.1) {i{f+g+. . + l)’‘dx}V^<{if^dx)y^ + ... + qmx)^>‘, 
and if 0<k<l then 

(6.13.2) {nf+9 + ... +l)’‘dxyi’‘>lSf^dx)V>‘+ ... -I- (JZ*da:)W*, 
unless f, g, I are effectively proportional. 

The inequality (6.13.2) is still true generally when k<(i, but 
there is a second case of exception, when both sides of the inequality 
vanish. 

We deduce this from Theorem 189 much as we deduced 
Theorem 24 from Theorem 13. Since the cases of equality are 
a little puzzling, we write out the proof of (6.13.2) in detail. 

If S—f+g+ ... + l then 

(6.13.3) iS’‘dx==if8>^^dx+fgS’‘-^dx+...+!l8*-^das. 
Suppose first that 0<1;< 1. By Theorem 19, 

' 8’‘ip+g*+...+l’‘. 

Hoaoe, if ^ffdx,... are finite, is finite. Also 
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iiTitwaH 5=0 and so/, gr, ... are all null. We may therefore suppose 
^S*dx positive and finite. 

By Theorem 189, 

ifS^-^dx > (//*'<?«)«*' (/ 8’‘dx)V*^, 

TinlAaa (o)/* and 5* are effectively proportional or (6)/5*’"^=0. 
fii'nAA fc— 1<0, and 8 is finite almost everywhere, the second 
alternative can occur only if / is null, and so reduces to a case 
of the first. Hence (6.13.3) gives 

(6.13.4) J5*da: > {(;/*'da:)«* + ... + (fl^da:)"*} 
unless /, gr, ... , i are effectively proportional; and the conclusion 
follows. 

The argument goes similarly when 1:<0, provided /5*da! is 
positive and finite. K i8'‘dx-0 then, since *<0, 5 is infinite 
almost everjrwhere, which is impossible since every / is finite 
almost everywhere. If ^S^dx is infinite then (again since k<0) 
jf^dx, ... are all infinite, and both sides of (6.13.2) are zero. 
This is the second exceptional case mentioned in the enunciation, 
and occurs, for example, when 

/=gF = ... = l = 0 

in a set £ of positive measure. 

We have excluded the cases ife=l and from the state- 
ment of Theorem 198. The first is trivial and the second is 
included in Theorem 186. We leave it to the reader to state 
Theorem 198 in a form corresponding to that of Theorem 24. 
Corresponding to Theorem 27, we have 

199. If k> I then 

(6.13.6) iif+g+ ••• +iy‘dx>Spdx+ ...+iV‘da, 
and if 0<k<l then 

(6.13.6) ^{f+g+...+l)’‘dx<Sf’‘dx+...+iV‘dx, 

unless, for almost all x, all bvi one off, g , ..., I a/re^o. If cdlof 
f,g,...,lare almost always positive, then (6.13.6) is true also for 

jjj ^ Q 

Theorem 198, with fc > 1, is a special case of the first of the 
following three mote general theorems, in which the senes are 
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finite or infinite and the ranges of integration arbitarary. We 
confine ourselyes to the case A;> 1; in general, the sign of in- 
equality is reversed when k< 1. 

200. If k> liken 

(6.13.7) 

utOeea f^(x)sC^4>(x). 

201. Ifk>l^ien 

(6.13.8) [S{/A(*)dx}*]V*<J{S/„*(*)}V*<fo,, 

urdeas /„(x)sO„^(x). 

202. If Jb > 1 iJien \ 

(6.13.9) [/{//(*, y)dy}‘dx]V‘<J{//*(a:,y)dx}Vfcd2,, 

unless f{x,y) = <f>(x)>lt{y). 

In each theorem there is equality in the exceptional case. 

Consider for example Theorem 202 (the least elementary of the 
theorems). We begin by proving the theorem with ‘ ^ ’. We give 
two proo&, in the first of which we appeal to Theorem 191. In 
each proof the chain of equalities and inequalities which arises 
is to be interpreted in the sense ‘if the right-hand side of any 
equality or inequality is finite, then so is the left-hand side, and 
the two are related as stated’. The inversions of the order of 
integration are Justified by Fubini’s Theorem. 

We write J=J(x) = J/(x, y)dy. 

(i) In order that 

(6.13.10) 

it is, by Theorem 191, necessary and sufficient that 

(6.13.11) IJgdx^M 
for all g for which 

(6.13.12) ig^dx^l. 

Now 


(6.13.13) /t/flrdx=/p(x)dx//(x,y)dy 

=]dy(ig{x)f{x,y)dx)^idy{!f’‘(x,y)dx)^, 
by Theorem 189 and (6.13.12). Hence we may take 
Jf«/dy(;/*dx)W' 

in (6.13.10), which proves the theorem (with ‘ ^ ’). 
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(ii) If / = 0, then J * 0 for almost all x, and so (for almost 

all x ) /» 0 for almost all y. Hence, after § 6.3 {d),f(x, y) m 0. 

We may therefore suppose that > 0. Let us assume for 
a moment that is finite. Then 

JJ*d*=JJ*'-id®J/dy=;dy/J*-Vd®gJdy{g/*'d®)W*(JJ*d®)W^} 
= (JJ*d®)i/*';(//*d®)V*dy, 

and so 

(6.13.14) (JJ‘d®)i^‘'g/(//*d®)»^*'dy, 

which is (6.13.9), with ‘ ^ ’ for ‘ < ’. 

In this proof we have assumed JJ^d® finite, an assumption 
which was not required in proof (i). In order to get rid of the 
assumption, we must approximate to/ by some functionforwhich 
the assumption is certainly justified. Suppose for example that 
the integrations are over finite intervals or sets of finite measure, 
that (/)„ is defined as in § 6.1, and that 

Jn=S{f)ndy- 
Then / J^^d® is certainly finite, and so 

(J J/d®)V* (/)„*d®}V^dy ^/(J/‘d®)V«=dy. 

From this (6.13.9) follows, with ‘ by making n-*co. 

The arguments under (i) and (ii) are essentially of the same 
character, the part of the arbitrary g in (i) being played in (ii) by 
the definite function 

®'~(JJ*'d®)^*^’ 

which satisfies (6.13.12) »//J*d® is finite. By using this particular 
g, we avoid an appeal to a rather sophisticated general theorem, 
but at the cost of some additional complications. A similar alter- 
native presents itself whenever we make use of Theorem 191 . 

It remains to discuss the possibility of equality in (6.13.9). 
There will be inequality if* 

^Jgdx<M 

for all g subject to (6.13.12). There is inequality in 
/dy (Jj/d®) Sidy{(^f'‘dx)Vf‘(ig^dxn 

• See the last remark of § 6.9. 
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unless, for almost all y, /* and are efifeotively proportional, i.e. 
unless (for almost all y) 

(6. 13. 16) p(y) /* (*, y) = IT (y) g*' {*), 

where p*+<T®>0, for almost all x. If p(y) were zero, for a y for 
which (6.13.15) holds, g (x) would be null, which is false. ^Hence, 
in (6.13.15), p (y) > 0, and so 

where t/i= This equation holds, for almosi all y, 

for almost all x, and therefore, by § 6.3 (d), for almost aU x^ y. 

The proofs of Theorems 200 and 201 follow similar lines. Thus, in 
proving Theorem 201, we write 

Jn=Sfndx 

and argue as follows. In order that it is necessary and suffi- 
cient, by Theorem 15,* that J„<M whenever 1. Also 

and so on. The summation -under the integral sign is justified by (ii) of 
§6.3 (t). 

The analogue of Theorem 26 is 
208. If0<r<8then 

y) < SUV' aW,<»>/(®, y), 

unless f(x, y) = 4> (x) 0 (y). 

For an explicit proof see Jessen (1). 

6.14. Mean values depending on an arbitrary function. 

There is a theory of integral mean values involving an arbitrary 
function similar to that developed in Ch. III. We do not set it 
out in detail here because it would be so largely a repetition, in 
a dightly different form, of what we have said already. We 
confine ourselves to proving the analogue of Theorem 95.'> 

204. Suppose that a ^/(x) < where a and p may be finite or 
infinite, and ffiatf{x) is almost always different from a and j3; that 
the range of integration and the weight function p{x) satisfy the 

* Extended to infinite series. 

^ A number of other analogues of theorems of Ch. Ill are stated among the 
miscellaneous theorems at the end of this chapter. A fuller treatment of some of 
them will be found m Jessen’s papers 2 and 3. A good deal of the content of these 
papers has been incorporated, with the appropriate modifications, into Ch. IIL 



INTEOBAIiS 161 

conditions of %Q. 6’, and that {t) is positive and finite for <t<t<B. 

Then 

(6.14.1) 

^yipda) Ipdx 

whenever the right-hand side exists and is finite’, and there is egucdity 
only when fs C. 

It is possible that J'/pda;=oo orffpdx- -oo; (6.14.1) is then 
stai true if interpreted in the obvious manner. It is not possible 
(when the right-hand side is finite) that J/pd® should not exist, 
i.6. //■'■pd®=oo and J/~pd®= — oo. For in this case a=! — oo, 
fi—co, and ^(/), being convex and not constant, must tend to 
infinity, with rapidity at least that of a multiple of ( / 1 , either for 
large positive or for large negative values of/,® so that i<f>{f)pdx 
cannot exist and be finite. 

We take p — q, ^qdx= 1, and suppose first that W=Sfqdx is 
finite. If / is not effectively constant, a < 9R < j8. Also / is finite, 
and a.<f<P, for almost all ®; so that, for almost all x, 

if- SR) <f>' (9R) -h i if- mM’ (p), 
where p lies between / and 9R, so that a.<ii<fi. Hence 

which is (6. 14. 1 ) . There is equality only if (/— 2Jl)® ^" (/*) s 0 ; but 
a< jLK jS, and so ^"(/i)>0, for almost all x, so thatthen/sSDl. 
Next suppose (say) //gd® = oo, so that ^ = oo. Then 

<t>{f if)nidx} gX^{(/)„}g'd®, 

by what has been proved already. Since (f) (/) is continuous and 
monotonio for large/, the integral on the right tends to (/) qdx^ 
while that on the left tends to ^(co). Hence ^(oo) is finite, in 
which case ^ is decreasing and 

<^(oo)<^{/). 

It follows that 

^ (oo) = ^ (oo) J if) qdx, 

with equality only if <f> (/) s ^ (oo), a possibility which we excluded. 
The case in which jfqdx= — oo may be discussed similarly . 

ft See Theorem 126. 
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It is possible that the left-hand side of (6.14.1) should be — oo. 
The reader mil find it instructive to verify that the various oases 
which we have contemplated can all occur. 

If we take 0 (i) s= — log t, we obtain 


exp ^ J qlogfdx^ ^jqfdx, 


i.e. ® (/) ^ ^ (/) (Theorem 184). If we take ^ == we are lekagain 
to Holder’s inequality, and other examples may be consi^cted 
analogous to those of § 3. 1 1 . If we take = t log t, we find 


205. 


ipdx 


<exp 


( iPf^ogfdx \ 

\ iPfdx )’ 


unless f= C. 

We can extend the result of Theorem 204 (except for the 
specification of the cases of equality) to any convex and con- 
tinuous <f>. 


206. The inequality (6. 14. 1 ) ia true whenever <f> (t) is convex and 
continuous in a,<t<p. 

After § 3.19, we have 

^(/)s^(®i)+A(/-m 

where A is any number between the left and right hand deriva- 
tives of for Hence 

i4»(f)qdx^^{m), 

which is (6.14.1). 


STIBI.TJES INTEOBALS 

6.15. The definition of the Stieltjes integral. We have so 
far considered series and integrals separately, and all the funda- 
mental theorems have appeared ip dual form; thus Holder’s 
inequality ^ contained in Theorems 13 and 189. It is natural to 
look for an extension of these theorems which combines them 
into one, and we can find such an extension by using Stieltjes 
integrals. 

Suppose that 4> (x) increases (in the wide sense) ina^x^h, and 



INTBOEALS 163 

that f^(a ) « a, <^{b) = p. We suppose oc and p (but not necessarily 
a and 6) finite^. The curve 


y-=y{x)^<l>{x) 

is a rising curve which may have an enumerable set of ordinary 
discontinuities or of stretches of invariability. The inverse 


function 


x^x{y)^x(<j>) 


is defined uniquely except (a) in intervals y^ of y corre- 
sponding to discontinuities x=^^oi<j> and (6) for values of y which 
correspond to stretches of invariability of (f>. If we agree that 
iVt* 1 / 2 ) stretch of invariability of x(y), in which it has the 
value then x(y) is defined except for the values (6), and is an 
increasing function of y for the values of y for which it is defined. 
Finally we complete the definition of x{y\ as an increasing 
function of y, by assigning to it, for a value (6) of y, any one of 
the values of a? in the stretch of invariability. These values of y 
are enumerable, and our choice of x{y) for any of them has no 
effect on the definitions which follow. 

We now define the Stieltjes, or Lebesgue-Stieltjes, integral 

f f{x)d<l>{x)=s f f(x)d(f>i 
J z^a J a 

offix) with respect to ^(a;), by 

(6.16.1) 

whenever the integral on the right-hand side exists as a Lebesgue 
integral^. 

The definition (6.16.1), due to Radon (1), reduces the theory 
of Stieltjes integrals to that of Lebesgue integrals, and we may 
therefore expect that no new difficulties will arise. For full dis- 
cussions of this and older definitions of the Stieltjes integral, we 
may refer to Hobson (1), Lebesgue (1), Pollard (1), Young (7). 


» If, e.g., & = oo, then )5= Um (a?). 

If e is any function of^bS^ded variation, then where ^ ^ 

inoreas^g functions, and we may define the Stieltjes integral off with respect to g by 

We shall not require thi#more general definition here. 
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We can define 

J E 

where ^ is an increasing function, and ^ a set of values of x, 
similarly, that is to say by the equation 


where € is the set of values of <f> corresponding to E 
assume € measurable. The integral 



must 


f # 

J E 

is variation of ^ in E. 


6 . 16 . Special cases of the Stieltjes integral. The simplest 
cases are the following: 

(а) <j>=‘X. In this case the Stieltjes integral reduces to the 
ordinary Lebesgue integral. 

(б) ^iaan integral. In this case 

J a J a 

(c) ^ ia a finite increasing step-function. 

Suppose that a = Oi < < . . . < a„ = 6, that <f> {x) — , where 

**:<*fc+i» ™ when l<ifc<n, has 

any value consistent with the fact that <f> increases. Then x{y) is 
a step-function with values Oj, a ^, ..., a„, and 

( 6 . 16 . 1 ) Cfd<f>= r/{*(^)}# 

J a J a 

•= (otj— a)/(ffli) + («2~ *i)/(®2) :•■••• + 

+ (^~«n-l)/(®n) 

=‘^Pkfi°’k)> 

where is the saltus of ^ at « = % . It is plain that any finite sum 
can be expressed as a Stieltjes integral; thus 

where ^ is a step-function with unit jumps at %, a^, ..., a„, and 
»**/(«*)• 
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(d) These considerations extend at once to step-fonotions with 
infinitely many discontinnities, when the Stieltjes integral is 
siuumed over all the discontinuities. Any convergent 
infinite series may be expressed in this way as a Stieltjes integral. 

6.17. Extensions of earlier theorems. It will now be clear 
that all our fundamental theorems may be extended at once to 
Stieltjes integrals, and that the theorems thus obtained include 
those for Lebesgue integrals and also those for sums. We state 
the most representative of these theorems in the next section. 
Two preliminary remarks will be useful. 

(1) When the Stieltjes integral is written as a Lebesgue 
integral, the variable of integration is Our conditions for 
equality were always of the type f=9 except in a set of 
measure zero. The exceptional set in our new theorems will be of 
measure z&ro in, (f>, and when we state this concept again in terms 
of X it becomes ‘a set of values of a; in which the variation of ^ is 
zero’, i.e. a set E such that the corresponding values of <f> form 
a null set. Our conditions for equality must therefore all be in- 
terpreted in this sense. Thus is effectively proportional to g’ 
means that 

where A and B are constants, not both zero, except at the points 
of a set over which the variation of <{> is zero. It will be observed 
that such an exceptional set caimot include any point at which 
0 (a;) is discontinuous. 

A oirnilnr point occuTS in the definition of Max/ and J^/. 
Thus Max/ is the greatest number $ such thM, for every positxve e, 
f>^—eina set in which the variation of is positive. 

(2) Many inequalities ‘ X < 7’ are true for Lebesgue integr^ 
when their analogues for Stieltjes integrals are true o y wit 
‘ g ’. Suppose, for example, that the integrations are over (0, oo) 
and that J/d®- 1. Then, by Theorem 181, 

(6.17.1) (Jr^da:)®<J/da:Ja?/da:=Jai®/da!, 
unless a^/sC/ or a^sC, which is untrue, so that (6.17.1) is true 
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in any oase. In the ooxresponding theotem for Stieltjea integrals 
we have 1 and 

(6.17.2) 

There is equality in (6.17.2) if a? = (7, i.e. if * is constant except in 
a set over which the variation of ^ is zero, or, what is the same 
thing, if ^ varies at one point only. Thus if O for 0 $ a; < 1, 
and ^(a;)a 1 for a;^ 1, then 

(Ja:d^)»=l = ;«*d^. 

6.18. The means 3Rr(/; ^)* We write 

Sl(/;^i)=2Rx(/;^), 

©(/; ^) = exp(te^)»a)lo(/;^). 

These definitions presuppose that the integrals involved are 
finite. If = 00 , we agree (following the conventions of § 6.6) 

that 21l,=oo when r>0 and 9R,=0 when r<0. The points dis- 
cussed in §§ 5.2 and 6.7 naturally recur here in connection with 
the definition of &. 

The theorems corresponding to Theorems 183 , 184 , 187 , 189 , 
192 , 193 , 197 , and 198 are as follows : we suppose for simplicity 
of statement that r > 0. 

207. Min/<SDl,(/)<Max/tt»ie««/s(7. 

208. @ (/) < in ‘particular ® (/) < %(/), unless/a C. 

209. If 'St,{f)i8 finite for some r, then 9K,(/)->@(/) when 
r— ^ -1- 0. 

210. If k>l, then 

J < (J«*d^)i'* (Jt^d^)^ 

mleaa and are effectively proportional. The inequalUy ia 
re/oereed when 0<1;<1 or 1;<0, eaxepl when u* and rf^ are 
effecHvdy proportional, or the left-hand side is zero {in which case 
the right-hand side is also zero). 
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This is Hdlder s inoiiiiality ; there axe naturally corresponding 
generalisations of Theorems 11 (or 10) and 188. 

211 . Ifr<3,thmmf{f)<m,{f),u‘nle88fsC. 

212. If 3!(tr(f) is finite for every positive r, then 3)^(/)-»-Maj:/ 
when f->+oo. 

218. log SD?,’’ (f) is a convex function of r , 

214. If k>l, then 

unless u and v are effectively proportioned. The inequality is in 
general reversed if Q<k<l or 

AXIOMATIC TREATMENT OP MEAN VALUES 

6.19. Distribution functions. In Gh. Ill we defined the 
mean value aJi^=3R^(a,g)=^-i{Sg^(a)} 

directly, and developed its characteristic properties from the 
definition. Here we reverse the process and give the ‘axiomatic^ 
treatment promised on p. 66. It is convenient to use the notation 
of Stieltjes integration, and it is for this reason that we have 
reserved the discussion imtil now; but the Stieltjes integrals 
which we use are actually all finite sums. 

In what follows we consider a special class of step-functions, 
defined for all real a;, which we qbSl finite distribution functions. 
We call F{x) a finite distribution function if 

(i) it is constant in stretches and has only a finite number of 

discontinuities, 

(ii) it increases (in the wide sense) from 0 to 1, so that 

J?»(-oo) = 0, i^(cx))=l, 

(iii) i?'(a?)=:i{J^(a;-0) + J?’(a;-h0)}for all®. 

The distribution function which has jumps q at the points a 
provides a representation of both the values a and the weights q 
involved in The simplest such function is 

JP(a;) = i(l + sgna;), 

^ We lei^ve the speoifioation of the exceptiooel oiises to the reeder. 
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which haa the smgle jump 1 at 0. If we write 

then 

(6.19.1) F{x) = -ZqE^{x), 
where 

I 

a=a,, g=gy (v= 1,2, 2?,= !, ai< 02 < ... <la„, 

is the general finite distribution function with jumps g at the 
points a. Also \ 

(6.19.2) J <f>{x)dF{x) = 'Zg^{a), 
and the mean value (3.1.3) may be written as 

(6.19.3) = <f>{x)dFix)y 

Any finite distribution function is 0 for a: < A and 1 for a; > B, 
A and B being finite niunbers depending on F. In what follows 
we confine our attention to a sub-class of these functions, viz. 
those which satisfy 

(6.19.4) F{x) = Q (a:<A), if’(a:)=l {x> B) 

for a fixed A and B. In these circumstances we say that F belongs 
to %(A,B). 

If ^ (a;) is continuous and strictly monotonic in the closed 
interval (A,B), then 2K^[.y] is defined, by (6.19.3), for all Jf* of 
® (A, JB). The values of <f) {x) outside (A, B) are not really involved 
in (6.19.3), and we may choose them as we please; it is natural 
to choose them so that (f> (x) is continuous and strictly monotonic 
for — ooga:^oo. 

6.20. Characterisation of mean values. Our object is to 
prove the following theorem. 

216. Suppose tAca there is a unique real •number corre- 

sptmding to each F of ^ (A, B), toilh the following properties : 

[ 1 ] {AS^^By, 

[2] if Fi and F^ belong to 2) (A, B), F^^F^ for all x, and 
F-t > F.for some x, then 
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[3] if F, F*, 0 bdomg to % {A, B), and 
mF]=m[F*i 

then mitF+(l-t)G]=m[tF* + {l-t)Cf\ 

for 0<t< 1. 

Then there ie a function ^{x), continuous and strictly inoreasvnq 
in the dosed interval {A, B), for which 

( 6 . 20 . 1 ) = = 4>{x)dF(x)y 

Conversely, »/ 2Jl[J'] is defined by (6.20.1), for a <f>{x) wiOi the 
properties staled, then it satisfies [1], [2], and [3], so that Oiese 
conditions are necessary and sufficient for the representation of 
9)1 [J’J in the form (6.20. 1).» 

We begin by proving the converse half of the theorem. If 9)1 [J’] 
is defined by (6.20.1), then it is obvious that it possesses property 
[1], and all but obvious that it possesses property [3], since 
^ (9)1 [t + (1 - i) (?]) = ti<l>dF+(l- t)if>dO 

= tf<l,dF * + (1 - t)fif>dO = ^ (9)1 [il** + (1 - 0 (?]). 

It remains to prove [2]. 

Suppose that F^ and F^ satisfy the conditions stated. Then 
there is a positive number p and an interval (a, ]8) such that 

^ 1 (®) ^ (®) ^z (®) 

in (a,)3).*> Hence 

f" ^^ f ^- f” 

J —00 J — oo 

= f” (Fi-Ft)d<l>^ 

J —00 

^j\F^-Ff)df,^p{HP)-<l>(a)}>0. 

* See Nagnmo ( 1 ), Kolmogoroff ( 1 ), de Einetti ( 1 ). We follow the lines of de 
iPinetti’s proof. 

^ There is an a;o for which 

i -0) + l ’ t (* b +0)}> i {^'.(*0 -0) +^’,(*6+0)}. 

Hence either ri(®o-0)>F,{a;,-0) orf,(xo+0)>i’,(a!o+0). In the first case there 
is an interval sati^ying the conditions to the left of x^, in the second case one to 
the right. . , 

® If we remember our understanding* at the end of § 6.19, about the dehmtion 
of ^(a;) outside (A, B). 
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6.21. Remarks on the characteristic properties. We 
have stiill to show that the properties [IH^] are sufficient to 
characterise the means . We insert first some general remarks 
concerning the ‘significance’ of the properties. 

(i) [1] asserts that ‘if all the elements of a set have tl^ same 
value, then their mean has that value ’ . 

(ii) [2] asserts that ‘9)1 [JT] is a strictly monotonic functional 

of J* It would not be sufficient to assert that (under the condi- 
tions stated) 9)1 [I'll ^ 9)1 i.e. that ‘9)1 [J’] is a monotonic 

functional’. 

Let 118 consider some examples. 

(а) The arithmetic mean 

21 (a, q) = Lga = JxdF = 21 [.F] 

is a strictly monotonic functional of F. In this case ^(a?) =a?. 

(б) We may define ‘Max a* as ‘the lower boimd of the values of, a for 
which jF'(a?)=: r (F being any finite distribution fimction with jumps at 
the points a). Then Max a=sij.[FJ is a functional of F which is plainly 
monotonic: if Fi^F^ for all x, then 

strictly monotonic : if Fi and JP, are defined by 

Fi=:Fg=0 (x<0); Fi = i,F^ = 0 (0<a;<l); F^^F^^^l (x>l), 

then fi[Fi]=:MeLx(0, l) = Max(l, l) = fi[F 2 ]- 

That /i[F] is not representable in the form (6.20.1) follows from the 
theorem itself; if it were, it would be strictly monotonic. 

(c) The geometric mean C5 = C5 (a, q) is a functional of F which is not 
strictly monotonic, since, for example, the sets (0, a^, ...) and (0, b^, ...) 
have the same (B. It is representable by the formula 

(5 = exp^y log xdF(x)j, 

This is of the form (6.20.1), with ^ (a;) = log » fora; >0; but (5 is not repre- 
sented in the manner prescribed by the theorem, since loga; — oo when 
aj-> 0. 

(iii) If we use [3] twice, the second time with F*, O, ©♦, 1 - ^ 
in place of O, F, F*, t, we see that 

(6.21.1) m[tF + (l^t)G]^m [tF^ + (1 ~ 0 
whenever Wl[F]^Wt[F*] and 9K[G]«9R[G*]. In other words 
(a) [tF ^{l — t)0]i8 determined uniquely by SK [J^], 3Jl {G] and t. 
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Moie generally 
( 6 . 21 . 2 ) = 
if SW[l'J=a)l[J’/] and Sg,= 1. 

A functional gf is said to be linear if 

pJ’ +«(?]= [f] + [t?] : 

in this case it has certainly property (o). If 2R[J’] satisfies (o) 
or [3], of which (o) is a consequence, we may call 9R[J] qmsi- 
linear. If also we agree to describe the property [1], as is natural, 
by consistency, we may state Theorem 215 shortly as follows: 
the most general consistent, strictly increasing, and guasi-linear 
functional of F is that defined by (6.20.1). 

6.22. Completion of the proof of Theorem 215. The 
functions {x). Eg (x), and (1 — t) E^ {x) + tEB (»), whereO < i < 1, 
belong to ® (A,B).“ We write 

{t) = — t) Ej^ + tEg], 

so that 0(O) = 3R[.B^]=A, ^(1) = 5IR[.®£] = 5. 

Let us assume provisionally that ^ {t) is strictly increasing and 
continuous. Then ^(t) has an inverse 

<l>{u) = tp-^{u) 

which is also continuous and increases strictly from 0 to 1 when 
u increases from A to £. If 

«=«/>(«), t = (l>{u), 

then [.E J = « = ^ (<) = 2R [( 1 - ^ («)) + («) Eg]. 

Hence, using [3] in the extended form (6.21.2), and the expression 
(6.19.1) for any finite distribution function F, we obtain 
aR[J’] = 9K[SjjE„] 

= [S 2{(1 - <!> (a)) +^(a) Eb}] 

= 3K [(1 — (a)) + (2qf>{a)) Eg] 

= ^(SS^(o)) = ^-MSg^(a)). 
the result of the theorem. 

It should b© observed that here ^(.4) = 0, ^(B)= 1. When a ^ has been 
found, it may (after Theorem 83) be replaced by any ot^+ 

* and Eg are extreme oases of functions of B)i if E belongs to £(A,B), 

then Bji itE iSa for all x. 


HI 
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It has still to be proved that ^(() is strictly increasing and 
continuous. 


(1) If 

then {l — ti)E^ + tiE£^{l — t^)Ej^ + t^EB 

for all X, with inequality for some x. Hence, by [2], / 

(^i) ~ [{ 1 — ^i) E^ + Eb^ < so? [( 1 — i2) E_^ + ^2 EbI — w 

(2) Suppose, if possible, that ^(i) has a discontinuitjnon the 

right at , where 0 ^ f - Then we can find a ^ such that 

for arbitrarily small e, and ' 


for all X, with inequality for some x. Hence, by [2], 

( 6 . 22 . 1 ) 

for any t of (0, 1). But if s and t he in (0, 1), then, by [1], 


and similarly for t\ and, by [3], 

(1 — 8)Ej^ + sEb {l — t)Eji+tEB~\ 

2 2 J 


r ■ pi 

^ + ~£!b 




Combining this with (6.22.1), we see that 

are separated by a number, viz. SD?[i.2rf+i^^(()], which is in- 
dependent of e; and so, making e->0. 


Uo). 


Hence tfi has a discontinuity at |(^o+0> all < of an interval; 
and this is impossible, because the discontinuities of a monotonic 
function are at most enumerable. 

It follows that ^ (0 has no right-hand discontinuity. Similarly, 
it has no left-hand discontinuity. It is therefore continuous, and 
thia completes the proof. 
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Wo have ooiifined our attention to finite distribution funotionB, bo that 
all the functions which have been considered are step-functions, and the 
means are the means of Ch. III. There is a similar theorem in which both 
hypothesis and conclusion are stronger in that they apply to a class of 
functions more extensive than X) (A, B). Let us denote by !)♦ (A, B) the 
class of functions which have the properties (ii) and (iii) of § 6, 19 and also 
satisfy (6. 19.4). We can then prove a theorem which differs from Theorem 
215 only in the substitution of for X). The proof is very much the same, 
but is slightly more elaborate in its final stages. See de Finetti (1). 

MISCELLANEOUS THEOREMS AND EXAMPLES 

216. ‘Velocity averaged by time is less than velocity averaged by 
distance.’ 

j^This ^ ^ /(s) ^ case of Theorem 181. J 

217. If the kinetic energy of a mass M of moving homogeneous in- 
compressible fiuid is Et and the average velocity of its particles is F, then 
E>\MV^, unless all particles have the same velocity. 

[If p is the density, v the velocity of an element dS^dxdydz, then 
M:=:pSdS, VSdS=^S^dS, E=:ipiv^dS, 
and the result follows from Theorem 181 (for triple integrals).] 

218. A unit electric current passes through a closed plame circuit 
enclosing an area A, and exerts a force F on a unit magnetic pole P in 
the plane of emd interior to the circuit. Then 

2AF*>(27r)» 

unless the circuit is a circle whose centre is P, 

[Suppose, for simplicity, that the circuit is ‘star shaped’ with respect 
to P (i.e. that every point of the line from P to any point of the circuit 
lies inside the circuit). Then, using polar coordinates r, 6 about P , and 
integrals from 0 to 27r, 

2„=jd0= j g)* (1^)* (J 

unless r is constant.] 

219. If/- (*. y) and (x, y) are two (finite or infinite) seta of functions 
of » and y, then (s/J/gr<fa:dy)»<SJ//*dirdySJJfl''d»dy, 

unless there are two constants a and b, not both zero, such that 
of,(x,y)^bg,{x,y), 

for every v. 

[From Theorems 7 (for infini te series) and 181 (for double integrals), 
or directly, by the second method of § 2.4. The theorem illustrates the 
following principle. The inequality 

(i) (2:ssMi>)*gsss«*rss»*. 
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where u and v are functions of three integral variables m, n and p, does 
not differ materially from the ordinary form of Cauchy’s ineqtiality ; but 
we can derive materially different inequalities from (i) by replacing dif- 
ferent selections of the signs of summation by signs of integration.] 

220. Suppose that the a eace positive, and that qr is defined by 

Then (»-=l, 2, ...) 

unless all the a are equal. 

221 . 

unless all the a are equal. 

[Theorems 220 and 221 were communicated to us by Prof. I. Schur. 
The q are means of homogeneous products of the a, like the p of § 2.22, 
but now the a in a product are not necessarily different. In particular 

Theorem 221 follows from Theorem 220 as Theorem 52 followed from 
Theorem 51. To prove Theorem 220 we observe that 

where a5„s= 1 — as,— ... — and the domain of integration is defined 
by a?i>0, ..., We obtain Theorem 220 by applying 

Theorem 181 (for multiple integrals) to (i). 

The formula (i) leads to a more complete theorem. If the a are real 
(but not necessarily positive) then the quadratic form Y^q^^^y^y^ is strictly 
positive; and if the a are positive, then the form Sgr+rfi^ry# is strictly 
positive; except (in both cases) when all the a are equal.] 

222. Kp>l,/i8lf^in(0,a),and 

/*/(<) *. 

then F{x)^o(x^^^) 

for smaU x. 

[By Theorem 189, 

F” ^ jy^dt (ij I’/Pdi, 

and the second factor tends to 0.] 

228. If p> 1 and / is in (0, oo), then F(x)szo(x^l^) both for small 
and for large x. 

[For small x, by Theorem 222. To prove the result for large x, choose X 
so that 

j'^fPdx<€» 
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and 8UX>po8e x>X. Then 

(F(x)- F(X))’‘=(^j ^fdlY 

F{x)<F {X) + tx^lf < iex^h' 
for sufiEioiently large x.} 

224. If ^ is an ifitegral except perhaps at 2 ; = 0 and xy'^ is integrable in 
(0, a)f then 

y-o 

for small x. 

225. If ^ is an integral except perhaps at 0 and 1, and a; (1 — a;) y'* is 
integrable in (0, 1), then y is and 

0 ^ j ^y*dx---(^j\dxy ^ij^^x{l-x)y'*dx. 

[That yiaL* follows from Theorem 224. The first inequality is included 
in Theorem 181 . For the second, we have 



sziJ^J\y(u)~-y(v)}^dudv 

f duf dv( f y'{t)d^ ^ f^dui^ (v-’U)dv{^ (y'(t)fdt 
Jo J u \J U / J 0 J u J u 

= fl (y'W)*dt ^‘du p^(v-u)dv = il\l-t)y'*dt. 


Of the two inequalities, the first can reduce to an equality only if y is 
constant, the second only if y is linear.] 


226. Ifm>l,n>—1, and / is positive and an integral, then 

/ ® vn t / /*« \ 1 

^ 1 / * f”'dxj "* |/'|"•c^a;j«, 

with equality only when /= B exp { — where B g 0, 0 > 0. 

In particular 

(ii) /o"-^**** ^ ^ ( C 

unless /s and this inequality holds whether / be positive or not, 

and also for the range ( — oo, oo). 

[The baost interesting case is (ii), which is due to Weyl (1, 346), and 
is useful in quantum*mechcuuos. 

Assume that the integrals on the right-hand side of (i) are finite. Since 
/iacontinuouB,andn<w(«+ l)/(»n- l).that on the left-hand ride is atoo 
finite. Hence 

lim ;*;"+^/"‘ = 0, 
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and so, integrating by parts over (0, **), where (*») is an appropriate 
sequence which tends to infinity with k. 

Urn 

Jo »+ljfc-^soyo 


But, by Theorem 189, 

nwlA«B /'<0 and /' and effectively proportional. This 

hypothesis leads to the form of/ stated.] 


227. If 6 increases, 

(ifgd<l>?<iPd<l>Sg^d4>. 

unless /and g are effectively proportional (in the sense of § 6.17). 

[Included in Theorem 210; wanted in Theorem 228.] 

228. If a ^ 0, 6 ^ 0, a + 6. and ^ is non-negative and decreasing, then 

unless C, where C> 0, m (0, |), and 0 in (f, co). 

K may be 0. The inequaUty is stronger than that resulting from a direct 
application of Theorem 181. It foUows from Theorem 227 ifwe reduce the 
integrals to the form considered there by partial intepation. The case 
asO, 6=2 was mentioned by Gauss in connection with the Theory of 
Errors: see Gauss (1, iv, 12) and P61ya and Szego (1. n, 114, 318).] 

220. IfogO, 6g0, o#l, and if> is non-negative and increasing, then 

(i - /;*“**/.'*»•**. 

unless ^=(7. 

[See P61ya and Szego (1, i, 67, 214). In this case Theorem 181 givesa 
rovetsed inequality, with the factor 1 on the right-hand side.] 

280. If Q<a<f^A<tx>, 0<6^ggB<oo, 

then I /‘d® | ("^) + sj (|b)} ' 

[Analogue of Theorem 71: see P61ya and Szeg5 (1, i, 67, 214).] 

281. If we consider the closed or open intervals (in general four in 
numhw) with end-points -o, 6, where a^O, 6^0, and suppose each of 
o and 6 zero, pomtive and finite, or infinite, we obtain in all 34 types of 
intervals I. Assign to each interval I a funotion/(») defined for 0<*< 1 
and such that logSHrC/), where 9W,(/) « formed for the interval (0, 1) and 
with g= 1, is finite just for the values of r in J. 
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[Examples: 

I is - a <r<bi f(x ) = ( 1 - ^log 

lis -oogrgoo; f(x)=l+x*-. * 

/ is the single i)oint 0; /(*) = exp ( - a;-* + ( 1 - *)-*); 

I is empty; f(x) = exp ( - a:-i + ( 1 - »)->). 

This contains part of the proof of what is stated near the end of § 6.11.] 

282» Oeometricai interprelation of Minkowski’s inequality. Suppose 
that a point in functional space is defined as a function of i*, two functions 
defining the same point if and only if their difference is null; and that 
the distance between two points / and g is defined by 

W,9)=V{l(f-9Ydx}. 

Then (i) the distance between two distinct points is positive; and (ii) 

8(/.'‘)g8(/.sr) + 8(fif,A). 

[If wo define distance by 

Hf,9) = (l\S-9VdxYl' (r&l). 
w© obtain similar results in ‘functional space jL*”.] 

233 . The shortest distance between two given points in Euclidean space 
is the straight line. 

[A curve in space is given by 

x=^x(t), y=y(t), z=:z{t). 

We may suppose that t increases from 0 to 1 on the arc in question. If we 
assume that x, y, z are integrals of functions of L®, then the length I is 
given by 

1* = [/(*'» + y'> + z'»)*dtf = SUlj (»'» + + «'“) S 9Jlj {x”‘) + (j/'*) + W^{z'•), 

by Theorem 198; and this is not less than 

iix'dt)* + (f y'dt)» + ifz'dt)^ = (*1 - XoY + (yi - VoY + (*1 - *o)* • 

If there is equality. Ax' = By' = Oz', and the curve is a straight line.] 


234. If0<p<land 


jfgdx^A{Sg’’'dx)^l’^ 


for all g, then If^dx ^ A 

[Compare Theorem 70. If/>0 for all r, define flf by /?=/*•. If/> 0in 
E, /=0 in CE, and the measure of CE is finite, define g by/fif=/‘’ in E 
and by ^ = G in CE, and proceed as in the proof of Theorem 70. If the 
measure of CE is infinite, take (for example) 


g=Ge-^ 

in OE, Then 

jfpdx= jfgdx^A [j j^Pdx+a»' j ^^eP'->dx'j ' , 
and the result again follows when Q oo.] 
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285* Suppose that/ and p are positive and that / has the period 2n; that 

/ 2ir / fiir 

^ f{x + t)p{t)dtl p(t)dt; 

and that the means 90^ refer to the interval (0, 2fr) and a constant weight- 
function. Then 


(0<r<l), mr(F)^mr(f) {r^iy 

[This may be deduced from Theorem 204, or proved directly (suppoajng 
for example 1) as follows: 


{lp(t)dtY 

For the case r = 0, see P61ya and Szego (1, i, 66, 212).] 


286. We say that /(a?, y, . . .) and (a?, y, . . . ) are aimUarly ordered if 

oppoaUdy ordered if f exid — gr are similarly ordered. Prove that 
/J. . ./da;dy . . . J/. . . grda;dy . . . ^/J. . . cfedy . . . JJ. . ./firctedy . . . , 

if / and g are similarly ordered, while the sign is reversed if / and g are 
oppositely ordered. The integration is extended over any common part of 
the regions of definition of / and g, 

[Analogue of Theorem 43 (with r = 1), due in substance to Tchebychef 
(who considers only monotonic fimctions of one variable).] 


287. If ^ and ^ satisfy the conditions of Theorem 156, and 

«(*)= jy(t)dt. ’F(*)= jy^dt, 

then ifgdx ^ J O (/) da? + J* T {g) da?. 

288. Iff and g are positive, and k a positive constant^ and/log+/ and 
are integrable, then fg is integrable. 

[By Theorem 63, k/g^flog+f+e^^"-^.] 


289. If/ is positive, then 

fyiog^dxg2 f*/log+/da?H-'^y?. 

y 0 a? j 0 e 

[Take gs=: j^log A? = 1, in the inequality used in proving Theorem 238.] 
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240. Iff is positive and L in (0, a), and 

Fix)=jydt, 

then ( /(z)log-die=: f F(a)log-, 

Jo a? J 0 X o 

whenever either integral is finite. 

241. Suppose that a is positive and finite; that B^B{a) denotes 
generally a number depending on a only; that /(a?) ^ 0; and that 

I a faP 

/log+/da;, K= ^dx. 

Jo J 0 ^ 

Then (i) if J is finite K is also finite, and 

K<BJ+B: 


(ii) when / is a decreasing function the converse is also true; if ^ is finite 
then J 18 finite, and J<BKlog^K+B. 

[For the last two theorems see Hardy and Littlewood (8).] 


242. If / is positive and L in (0, o) and 

then g is L and j^g(x)dx=: j f(x) dx. 

[Integrate by parts; or substitute for g and change the order of Integra- 
tion.] 


243. We define where ^ is a continuous and strictly increasing 

function, by 

Then, in order that (/) S (/) 

for all/, it is necessary and sufi&cient that tp should be convex with respect 
to 

244. In order that 

for all/5=/(a?j,a:j, it is necessary and sufficient that every be 

convex with respect to the corresponding 

245. In order that 

for all/, it is necessary and sufficient that (i) g„^p,^and (ii) g/i^pv when 
/i>v and the permutation by which vj, v,, ...» derived from 1, 2, ..., n 
involves an inversion of /i and v. 
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[For the last three theorems, which correspond to Theorems 92» 93 and 
137, see Jessen (2, 3).] 

246* Holder’s inequality may be deduced from 

2^1*' aRo*(/) ^ SRiM/) 

(Theorem 203), by taking 

[See Jessen (3).] 

247. If (i) <l>{x, t) is positive, continuous, and convex in x, for Xi^x 
^>0; (ii) p(0^0; (iii) the integral \ 

/(*)= t)p{t)dt 

is finite for x=Xi and a;=a; 2 ; then I{x) is continuous and convex for 

Xi<X<X^. 

[That I(x) is bounded and convex follows immediately from the 
convexity of <f>; that it is continuous, from Theorem 111.] 

24S. If fix) and ^(a?) are positive and <l>(x) convex for positive x, and 
is finite for x—Xi and a;=a;j|, then 7(a?) is continuous and convex for 

Xi<X<X2* 

[By Theorem 119, aj^(l/a?) and 

are convex, and we can apply Theorem 247. More general results can be 
derived from Theorem 120.] 


240. In order that 


r<l>igix))dic^ 

J a J a 


should be true for every convex and continuous it is necessary and 
sufficient that rj^ 

/ g(x)dx= /(x)dx 
J a J a 


aod P (g{x)-y)+dx£j\f{x)-y)+dx 

for all y, 

pSere a*" means Max(a,0), as in §6.1.] 

250. If / and g are increasing fimctions, then an equivalent condition is 


Pg(x)dx^ j^f{x)d!S 


for 
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[For the two last theorems, which embody analogues for integrals of 
parts of Theorem 108, see Hardy, Littlewood, and P61ya (2).] 

261. If /i(0f /jCOt ••• »/i(0 real and integrable in (0, 1), then either 
(i) there is a function x{t) such that 

J'fi(t)x(t)dt>0 l'/,(t)x(t)dt>0, 

or (ii) there are non -negative numbers 2/i , ^ 2 » • • • > aU zero, such that 

2/iA(<)+2/a/2(0 + ... + 2/i/i(0 = 0. 

252. If /i(0»/a(0» ••• >/m(0 ore real and continuous in (0, 1), then either 
(i) there are real numbers Xa* — » such that 

(0 + (0 + • . . + «m/m (0 

is non-negative for all, and positive for some, t of (0, 1), or (ii) there is a 
positive and continuous function y(t) such that 

ljAt)y(t)di=o, ... ,j'fjt)y{t)dt=0. 

[Theorems 251 and 252 are both integral analogues of an important 
theorem of Stiemke (1) concerning systems of linear^ inequalities. 
Suppose that 

^\fi (^~ 1 > 2 , 2 , ..., fn) 

is a rectangular array of I rows and m columns, and that 
= + ••• 

(^) = "if* 2/i + OZM 2/a + • - + OiM 2/i ; 

and consider the two problems: 

(i) to find a real set («) for which 

Li{x)>0, L^(x)>0y.,., Li{x)>0; 

(ii) to find a non-negative and non-null set (y) for which 

= M2(2/) = 0,...,M„(2/) = 0. 

Since l!iyL(x) = Sa;M (y)f 

the two problems cannot both be soluble for the same set (a), and Stiemke’s 
theorem asserts that one is soluble whatever the set (a). 

Theorems 251 and 252 state analogues of Stiemke’s theorem in which 
the m columns or I rows are replaced by a continuous infinity of columns 
or rows. These theorems, and further references to the theory of systems 
of linear inequalities, which we have excluded from our programme only 
on account of its algebraical and geometrical preliminaries, will be found 
in Haar (1) and Dines (1).] 
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SOME APPLICATIONS OF THE 
CALCULUS OP VARIATIONS 


7.1. Some general remarks. The ‘simplest problem of^the 
Caloulus of Variations’ is that of determining a mazimmn\ or 
minimum value of \ 

rxi ' 

J{y)=\ F(x,y,y')dx 

for all functions y—y(x) for which 

(1) are given, , 

(2) y' is continuous. 

Let us denote this class of functions by Then our object is to 
find a function y_ 

of Si, Buoh that either 

J{y)<rF(x,Y,Y')dx^J{Y), 

J XO 


or J (y) > J ( Y), for all y of S' other than Y. The general theory 
tells us that, if such a function Y exists, it must satisfy ‘ Euler’s 
equation’ 


(E) 


'dy~dx\dy'j 


Let us consider some simple examples. 

(i) Suppose that «f(y)= f y'*dx 

J 0 

and yo= yi*= Then (E) is y” = 0, and the only solution satis- 
fying the concfitions is y^^x. It is easy to verify that Y =>x does 
in fact give a minimum for J (y). For J ( F) => 1 and 

1 = ( Jo y' <*») < y'*dx-J (y), 


by Theorem 181, unless y'sl,y=x; so that J > 1 for all y othmr 
than F. 
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We have in fact proved more than the problem as stated 
demands, since the proof is valid whenever y ia an integral. This 
last hypothesis is however essential, since there are Amotions y 
for which 

(7.1.1) y(0) = 0, y(l)=l, y' = 0, J{y) = 0. 

In order that y should be an integral, i.e. in order that there 
should be an integrable function f(x) such that 

y(®)= I /(«)<*«, 

J a 

it is necessary and sufficient that y{x) should be ‘absolutely 
continuous’. It is necessary, but not sufficient, that y{x) should 
have bounded variation. In particular it is not sufficient that 
y{x) should be monotonic; there are increasing functions y which 
satisfy (7.1.1). 

If y is the integral of /, then y' =/; an integral is the integral 
of its derivative. All this is expounded in detail in books on 
the theory of functions of a real variable^. The main theorem 
needed in this chapter is the theorem of integration by parts, 
stated in § 6.3(a). 

These remarks lead us to lay down the following convention. 
Throughout this chapter it will be assumed that, whenever y and 
y' occu/r in an enunciation or a proof, y ia an integral (and so the 
integral of y'). A similar assumption wiU be made about y' and y" 
(if y" occurs in the problem); and the assumption naturally 
applies also to letters other than y. Without this assumption, all 
the problems of this chapter would lose their significance. 

(ii)** Suppose that 

J^(y)= 

and y„=yi= 0. The only solution of (E) satisfying the conditions 
is y=0. If r=0, J(r) = 0, but Y does not give a maximum or 
nnininmTn of J (y). It is in fact 68^ to oonstruot a y of % for which 

• See for example de la Vall4e Pousain (2), Hobson (1), Titohmaiah (1). 

** ^Hiis the next example are due to Weierstrass and are of great historical 
importance. 
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J(y) u ae laige, positively or negatively, as we please. Thus, if 
/(a?) is any function for which /(O) =/(l) » 0 and 

J^f»dx>0, 

and t/=Cf(x), then J (y) is large when 0 is large and has( the 
sign of C. 


(iii) Suppose that 




^0 yi = 1 • Here J (y ) > 0 for all y of ^ ; but y^x”^ gives J=^m, 
so that there are y of ^ for which J (y) is as small as we please. 
For y of J (y) has an unattained lower bound 0. The same is 
true for classes of y more general than ^ (for example, the class 
of integrals). On the other hand, J (y) attains its bound 0 for the 
function mentioned imder (i) above, and also for the discon- 
tinuous function which is 0 for 0 and 1 for x > 0. 


7.2. Object of the present chapter. The Calculus of Varia- 
tions might be expected to provide a very powerful weapon for 
the proof of integral inequalities. There are however hardly any 
instances of its application to inequalities of the types important 
in general analysis. This may be explained on two grounds. In 
the first place, the Calculus of Variations is concerned avowedly 
with attained maxima or minima, while many of the most im- 
portant integral inequalities assert unattained upper or lower 
bounds. Secondly, the ‘continuity’ hypotheses of the classical 
theory are very restrictive. It is often more troublesome to 
extend an inequality, proved by variational methods for a special 
class of functions, to the most general classes for which the in- 
equality is required, than to construct a direct proof of the full 
result. For these reasons the Calculus of Variations has been 
almost ignored in this chapter of analysis. 

The ideas of the Calculus are however often very useful, and 
we apply them here to a number of special inequalities. When, as 
in example (i) above, or Theorems 254 and 256 below, the bound 
asserted by the inequality is attained, and attained by an ex- 
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tremal, that is to say by a solution of Euler’s equation, these ideas 
are obviously relevant, and the result may well be one which it 
would be difficult to obtain in any other way. We shall however 
find that they are sometimes effective even when the bound is 
unattained and the final result lies outside the scope of the theory. 

Our arguments will not demand any detailed knowledge of the 
theory; except in § 7.8, we shall require only its simplest formal 
ideas*. 

7.3. Example of an inequality corresponding to an 
unattained extremum. As a first example of the use of varia- 
tional methods, we select a special case of a theorem which was 
first proved in an entirely different manner, and to which we 
shall return in § 9.8. 

253. If y' bdonga to (0, oo), j/q = 0 , avd yis notdkoayazero, Ihm 

•^(2/) = Jo 

It is necessary for our present purpose to consider the more 
general integral 

(7.3.1) 

Euler’s equation is 

a:V'+Ay=0 (A=l//igJ). 

Its solution is y= Ax”* + Bx!^, 

where = 

if/a>4, andis y=x*(,A + Blogx) 

if ja= 4. In neither case is there a solution (other than y = 0) for 
which y' is L*. 

For this reason it is necessary to modify the problem before 
we attempt to apply variational ideas. We consider 

* Knler’i equation and Hilbert’s invariant integral AnyAi^ yMoh we assume 
will be found without difficulty in the books of Bliss (1) or Bolza (1). 
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with ^ 0 =° M There is then one* extremal satisfying 

the conditions, viz. 

(7.3.2) y= 7=®"* =**+“, 
where 

(7.3.3) 

A simple calculation gives 

(7.3.4) 

\ 

and this suggests the following theorem. 

264. •> Ifn>4:, 2 /( 0 ) = 0, y(l)= 1, andy' is L\ then 

(7.3.8) = 

where a is defined by (7.3.3). The only case of equality is that 
defined by (7.3.2). 


7.4. First proof of Theorem 254. We give two proofs of 
Theorem 254. The first demands no knowledge of the Calculus of 
Variations, though the transformations which we use are sug- 
gested by our knowledge of the form of the extremal Y. 

If 


(7.4.1) y=a;*+“+7j= r+Tj, 


then 

(7.4.2) 

where 


J{3,) = J{Y) + J{yi) + K{Y,yi), 
i[(r,r,) = 2j^'(,iFV-§)d*. 


Since Y' and so if are L*, q=o{a^) for small x;’> and so 

JS:= 2 Ito ( - - J 


» The extremal y = Aa?l+* + (1 ~ 

gives yo=0» 2/1 for any A; but y' is not Z*. and J (y) diverges, unless Asl. 

^ For this and some later theorems in this ohapter see Hardy and Littlewood (10). 
^ Theorem 222. 
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Hence (7.4.2) gives 


(7-4.3) = + /(,). 

and it is sufficient to prove that 

J {r)) > 0 . 

Here rj' is L®, and rj is not null but vanishes at the ends of the 


J{y)^ 


interval. 

We now write 
Then 




(7.4.4) = 

ri /•! y2r2 

= Y^C'^dx+ YY’a'dx. 


But 

(7.4.6) 


2 ^ 1 *^ YY' a'dx= -,i{YY'^%-J\y'^+ YT'U^dx. 

J s j s 

Combining (7.4.4) and (7.4.5), and observing that F is a solution 


^F"+|=0. 


we obtain 

(7.4.6) Jsiv)^ -,j,(YYVh+l^j\YCrdx. 

But FF'^*=(i + a)^^ = (K«)“-^0 

X X 

when a?->0. Hence, when we make 8->0 in (7.4.6), we obtain 

(7.4.7) ^{YC'f^> 

which is positive unless the integrand is null, i.e. unless ^ = 0, 
This proves the theorem. The condition /a > 4 was required to 
malrfl F' belong to Z-®. We have however reduced the theorem 
to dependence on the identity (7.4.7). Since 


HI 
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(7.4.7) and (7.4.3) give 

and here T has disappeared. Since both sides of (7.4.8) are con- 
tinuous in jjLt we may now include the case ft =4, a = 0. The 
identity may be verified directly by partial integration when its 
form has been discovered (though some care is required about 
convergence at the lower limit). \ 

We are now in a position to prove Theorem 253. If we wri^ 
x=^XIi, cy(x):==T(X), 
and then replace X, T again by x, y, we obtain 

(7.4.9) 

where now y(0) = 0, y{^)=c. If y' is in (0, c»), c=o(|*) for 
large and the first term on the right tends to 0 when ^->-ob. 
Mahing ^-»-oo, and supposing p.= 4, we obtain 

This formula, which makes Theorem 253 intuitive, is valid when- 
ever y' is L^, and may of course be verified directly**. 


7.5. Second proof of Theorem 254. In our second proof we 
make explicit the variational theory which underlies the first. 

Suppose that y = F (*), or E, is the extremal through the end- 
points Pg and Pi , and that 

(7.6.1) y=y{z,a.), 

or .^(a), is a family of extremals containing E and depending on 
a parameter a. Suppose further either that 

(i) E (oc) covers up a region surrounding P in a (1, 1) manner, so 
that just one extremal passes through every point of the region, 
and a is a one-valued function of x and y ; or that 

(ii) every curve of P(a) passes through Pg, so that y{x^,(t) is 
independent of a, but condition (i) is satisfied in all other respects. 

* Theorem 223. 

^ Grandjot (1) gives a number of somewhat similar identities for series* 
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In these ciromnstances* (7.6,1) is said to define a fidd of ex- 
tremals including E. 

The slope y'{x,a) 

of the extremal through a point P of the field may be expressed 
Bs a one-valued function 

p=^p{x,y) 

of X and y. Hilbert’s ‘invariant integral* is 
J*{C) = j^{(P -pF^) dx + 

Here F and F^ are the values of F{x,y,y') and Fj^(x,y,y') 
when y' is replaced by p, and the integral is taken along any 
curve C which lies in the region covered by the field. 

The fundamental properties of Hilbert’s integral are as follows. 

(i) tf * (0) depends only on the ends Q, J2 of (7 ; in other words 

{F-pFj,)dx-\-Fpdy=dW 
is a perfect differential, and 

J*{G)=Wn-WQ. 

(ii) If <7 is the extremal E, then 


J*{E)= f Fdx=J{E), 

J B 

say. It follows that, if C runs from P, to Pj, then 
J(0)-J{E)=J(C)-J*{E) = J{C)-J*(C) 

~ j ^F{x,y,y')dx 

- j J{F{x,y,p)-pFp{x,y,p))dx+Fj, (x, y, p)dy} 
{x,y,p,y')dx. 


where 


€ (a?, y,p, y ') = p (*, y, y') - F (®, y,p) - iy'-p) Pp (», y,p). 

* With the addition of certain conditions concerning the differentiability of 
which it is mmeoesaazy to repeat here: see Bolza (!» 0&-105). 


X3-3 
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Here y' is the slope of G at any point and p the slope of the 
extremal through the point; and S. is Weierstrass’s ‘excess- 
function*. If €> 0 whenever y' then 

J(E)<J{C) 

and E gives a true minimum of J. 

In the present case we take 

y = 

as E(<t). We find 


p = a(Ha)x-t+“=(i + a)|, 


F=(j,p^ 


J*: 




2y 


F-pF^=-- 


P (i-a)x’ ^ ^ 


where 


W = 


'(^-a)x‘ 

Here np^ — {y’ —p) 2y-p = fi(y' — p)^ > 0 

unless y'—p. The identity 

J((7)- J(A’) = | 6dx 


reduces to 




which is (7.4.8). 

This argument shows the genesis of (7.4.8), but does not prove 
it, for two reasons. In the first place, F has a singularity, and 
the theory of the field breaks down, for x=0. Secondly, the 
theory presupposes the continuity of y\ 

In order to dispose of the first difficulty, we may take and 
Pj to be (8, 8^+®) and (1, 1). The theory then gives the identity 


and we obtain (7.4.8), for continuous y', by making 8 tend to 
zero. 

When (7.4.8) is proved for continuous y', it may be extended 
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to general y' of U by standard prooesses of approximatioa. We 
deal with this point in the next section, in a different problem 
where we have no alternative elementary proof. 

Similar considerations lead to the identity 

(7.0.2) 

Here 

y(0)=0, y(l)=i, y'so, A>1, =-^. 

and A is the (unique) root of 

(7.6.3) /X (Jfc -- 1 ) - 1 ) -h 1 = 0 

which lies between 1/i' and 1. When the form of (7.6.2) has been 
determined we may put 


where A=! 1/fc' and /xA*= 1. We thus obtain 


jfc-1 


+ir 


It may be verified directly, by partial integration, that this is 
true whenever y* is and we can prove as in §7.4 that the 
identity remainB true when the upper limit 1 is replaced by oo and 
the term Jk/(Jfc — 1) is omitted. Since, by Theorem 41, 


for all positive a, 6, we thus obtain a proof of a theorem (Theorem 
327) which will be stated exlplicitly, and proved in an entirely 
different manner, in § 9.8. 

Incidentally we obtain 

( k 


• Th© theory of the field gives the form of the identity, which ^y then be 
verified independently. The limitation to curves for which would mtroduoe 
another slight oompUoation into a properly variational proof. 
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y(0)=s0, y(l)e=l, onrf|/'i8 

'Jo 

where \ is the root of (7.6.3) between l/k' and 1. 


7.6. Further examples illustrative of variational 
methods. It is difficult to distinguish at all precisely betwron 
‘elementary’ and ‘variational’ prooffi, since there are many 
proofs of intermediate types. We give a selection of such proolis, 
worked out with varying degrees of detail, in this and the succeed- 
ing sections. 


(I). 256. If 2 / (0) = 0 and 2k is an even positive integer, then 

(7.6.1) I* y^dx^cf y'^dx, 

Jo Jo 


where 

(7.6.2) 


1 


2k-l 



2k} 


There is equality only for a certain hyperelliptic curve. 

(i) We suppose first that y(l) +0, in which case we may take 
y(l)= 1, and consider 

J(y)=^\Gy'^-y^)dx. 

Euler’s equation is 

{2k- 1) +3/«-i= 0, 

which gives {2k — 1) Gy'^ =G' — y^, 

where C" is a constant of integration. 

There is one extremal which passes through (0, 0) and (1, 1) 
and outs 1 at right an^es. In fact, if we take (7'«- 1, then y' 
vanisheB when y=‘ 1. Also 

and, sinoe 
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there is an extremal of this type which passes through (0, 0) and 
(1, 1). If we denote this extremal by Y, then 

J(7) = T {{2k- 1) (7F'2*-2 Y" + Y^-^}dx=0, 

since F'(1)=:0. 

To prove the theorem, we must show that y=Y gives a strong 
minimum; and this follows easily from the general theory. The 
extremal is a rising curve of the same general form as the curve 
y^sin^Traj, to which it reduces when 1. The curve y = ay is 
also an extremal ; the family y = a F defines a field in the sense of 
§7,6; and the excess-function 

S = (y' — 

is positive. Hence the standard conditions for a minimum are 
satisfied. This proof is genuinely ‘variationar, and (in view of the 
trouble of calculating the slope-function p explicitly) it might be 
difficult to find a more elementary proof. 

There is however one point in the proof which demands an 
additional remark. The ‘general theory’ assumes that y' is con- 
tinuous, and it may not be obvious how its conclusions, in par- 
ticular in regard to the uniqueness of the solution, are extended 
to the more general y considered here. 

Let us denote by the class of integrals y of functions of 

by J* the class of integrals y * of continuous functions. The general 
theory shows that 

(7.6.3) J{y*)>J{T) 

for a y* different from F, while we require the same result for 
any y of We can approximate to a y of different from F , 

by a sequence of functions y*^, in such a manner that 

J(y) = lim J(y*); 

but all that then follows from (7.6.3) is 

J(y)^J(F), 

the strict sign of inequality being lost in the passage to the limit* 
The difficulty disappears if we look at the question differently. 
The general theory proves not only the inequality (7 .6,3) but also 
the identity _ /(F)=/g (y*)da;. 
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where €(y*) ie the exoeea-fanotion oorrespondtog to jf* and the 
field y= aF. Approximating to y by an appropriate sequenoe of 
y*, we replace this identity by 

J{y)-J{Y)^!€{y)dx, 
and the integral is positive unless y<^Y. 

(ii) The case in which y(l)=0 may be discussed similarly, 
since y=0 is then an extremal satisf 3 nng the conditions, and is 
included in the field y=itY used in (i). 

The proof might have been arranged differently if we had 
made no hypothesis about the value of y(l). The problem is 
then one with a ‘variable end-point’, that of minimising J(y) 
for curves drawn from the origin to meet the line z=l. The 
extremals cut this line ‘transversaUy ’ (in this case orthogonally), 
and all the curves y^etY satisfy this condition. The general 
theory shows that all the extremals give the same value of J (y), 
and this value must be 0, since it is 0 when a= 0.* 

7.7. Further examples: Wlrtinger’s inequality. (II) Let 
us consider more particularly the case 1; » 1 of (I). Changing the 
limits, the result is that 

r in ^hn 

(7.7.1) y»dx< y'*dx 

Jo Jo 

if y (0) = 0 and y is not a multiple of sin x. 

The general theory suggests that there is an identity of the type 

(l/'‘-y^)dx=j^ {y'-y^(x)}*dx 

or {y2(l + ^2)-2yy'^fr}da:=0. 

This will plainly be true if 

y* ( H- ^*) dx - 2y^dy 

is an exact differential dz, and z vanishes at the limits; and this 
requires -tan(x-i-fc), 

» We owe tbeee remarks to Prof. Bliss. 
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in which case *= — If we take cot*, then t 

Tanishes when »«= i«r; and since y’ in and so* y=o(**), * also 
vanidies when * s 0. We thus obtain 

(7.7.2) f* (y'*-y*)d*= f*^(y'-ycot*)*d», 

Jo Jo 

which makes (7.7.1) intuitive. 

A slight modification of (7.7.2) leads to 


267. If y (0) = y (w) = 0 ond y' is L\ then 

f y^dx< I y'^dx 
Jo Jo 

unless y-Cmix. 

For y=o{ 3 ^) for small *, and y=o{(ir— *)*} for * near w,* so 
that y*cot* vanishes at both limits. Hence 

(7.7.3) j*^(y'*-y®)<i»= (y'-ycota:)®da;. 

Another modification of (7.7.2) leads to a more interesting 
theorem due to Wirtinger**. 

258. If y has fhe period ^,y' is L^, and 


(7.7.4) 

then 


^ 2ir 

ydx=0, 

r 2ir f 2tr 

I y'^dx 


unless A cos a: + J5 sin x. 

We cannot write down at once an identity similar to (7.7,2) 
or (7.7.3), but with 0, 2 ‘ 7 r as limits, because yaotx will usually 
have infinities in the range of integration. We may however argue 
as follows®. 


b See Blaschke (1, 106). The most immediate proof ie by on appUcotion of 
Par0eval*8 Theorem to the Fourier developments 

y~K+SKooBn*+6„8iii«*). y'~2:(n6„ooBn»-«o„«in»*) 

« The proof which follows was communicated to us by Dr Hans Lewy. 
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The function z(a:)=y(»+ir)— j/(a:) 

haa opposite signs for the arguments x and x+tt, and therefore 
vanishes at least once in (0, v). We suppose that z(a) = 0, where 
0ga<7r, and write y(a) = o. Since y' is L®, (y— o)®cot(a:|— «) 
vanishes for x=x and x = a + v ; ‘ and 


[y'* - (2/ - «)* - {y' - (y - o) cot (a: - a)}*] dx 

= — o)*cot(ar— a)J =0.\ 

Hence, using (7.7.4), we obtain 

/*2ir /*2v 

I (y'*— y®)da:=2iTO®+ I {y'—(p—a)cot(x—a)}^dx, 

Jo Jo 


which is positive unless a = 0 and 

y'sycot(a:— a), y= (7 sin («—«). 

There is a special interest in Theorem 258 because the proof of 
the classical isoperimetric property of the circle may be based 
upon it. We consider a simple closed curve C whose area is A and 
whose perimeter is L, and take 



where s is the arc of the curve, as parameter, so that 


x=x(<f>), y-y{4) (0g^g2w). 


We suppose for simplicity that x' and y' are continuous; the 
proof is valid for more general x, y. We may also suppose without 
loss of generality that the centre of gravity of the perimeter lies 
on the axis of x, so that 


We have then 



0 . 


,d<f>j '^\d<f>j J4v»'‘4v*’ 


* Utmg Theorem 222 m in § 7.4 and above* 



187 


THE OAIiOTTLUS OF VABIATIONS 

by Theorem 258. There will be inequality unless 
y = Acos(f> + 

and also *= -Jyd^= -^sin^+£co8^+(7, 
when the curve is a circle**. 

7.8. An example involving second derivatives. (Ill) It 
is known® that, if/ has a second derivative for ® ^ 0, and /tg , 
are the upper bounds of |/|, |/' |, |/" |, then 

This suggests that there maybe a corresponding relation between 
the integrals 

j,=r\f"\^dx. 

Jo Jo Jo 

where p ^ 1. The next theorem settles this question in the case 

p = 2. 

259. If y and y" are in (0, cjo)**, then 

M y'^dx\ <4j y^dxj y''^dx, 

uideaa y—AT{Bx), where 

F=e~*®sin(a:siny— y) (y=iw), 
when there ia equality. 

• Or -i[^ v — dA, acoording to the sense of the Taristion of s as we pass in 

Jo ^ 

a giyen sense round the curve. 

The proof is in principle that of Hurwitz (2), but differs (o) in that we do not 
use the theory of Fourier series and (6) in our unsymmetrical treatment of x and y. 
<» Landau (2, 3). ^ ^ j, ^ s j 

« In accordance with the convention of §7.1, y' is the integral of sr andyofjr. 
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If we consider tbe ‘isoperimeixioal* problem of tiie Oalmdos of 
VariationB defined by 

Vi= J y'^da: maximum, *^0= J ^ •^«=' J ^ }f"*dx <given\ 

we have to form Etiler’s equation for 

(y'*-Xy*-fiy"^)dx. 

It is a linear equation of the form 

ay""+by"+cy=0, 

whose solutions are linear combinations of real or complex ex- 
ponentials. When we try to choose the parameters in the most 
advantageous way, we are led to consider the function Y. 

It seems difficult to complete the proof on these lines by use 
of the general theory. We shall deduce Theorem 259 firom the 
simpler theorem which follows. 

260. Under the conditions of Theorem 259, 

J(y)=j^ {y^-y'*+y"')dx>0 

uideae y—AY, when there is equality. 

We give several proofis of this theorem to illustrate differences 
of method. Thefirst two are, as they stand, elementary ; the third, 
of which we give only an outline, makes explicit the variational 
theory which lies behind the other two. We begin by an obser- 
vation which is necessary in any case, viz. that is finite. 

To prove this, we have 

(7.8.1) -J^ yy"dx. 

Since Jq and J, are finite, the last integral tends to a finite limit* 
when X-*-eo. If were infinite, yy' and a fortiori 

y*-2iyy'dx 

would tend to infinity, which is impossible on account of the con- 
vergence of Jo • Hence is finite, and all three terms in (7.8.1) 

* By Theorem 181# 
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tend to limits. In particular yy" tends to a limit, which can only 
be 0 (again on account of the convergence of J,). 

(1) Our first proof proceeds rather on the lines of §7.4. It is 
easily verified that 

y+y'+r"=o, r(o)+r'(o)=o, r"(o)=o 

and J(Y)=sOfi 

We now write y=z + cY, 

choosing c so that a (0) = 0. Then a, s', and a" are L*, a and a' are 
o (**) for large «,'> and as' -> 0. 

Now J(y)=^J(z) + 2cK(Y,z) + cU(Y), 

where 

K = p{Yz- Y'z’ + r"s") 6m 

= - f“(y'+r")sa!x- {" {Y+Y')z"dM- Y'z'dx 
JO JO Jo 


=j; 


(7+r'+r")2'(£a; = 0. 


Hence J{y)^J ( 2 ), and it is enough to prove that J{z)>0 unless 
2 =s 0. But, since 2 (0) = 0 and zz* -> 0 when a? 00 , 


rz'^dx=- - f" zz^'dx, 
Jo Jo 


and so 


/•oo 


{z^^zz"-i^z"^)dx>0 


unless 2 J = 0. This proves Theorem 260, 

(2) We may try (following the lines of § 7,6 and § 7.7) to reduce 
Theorem 260 to dependence upon an identity. For this, we make 
(7.8.2) {y* - y'* + y"* - (y" + 4>y + #')*} dx 

an exact difieiential; and the simplest choice of ^ and iji is 
1, when (7.8.2) reduces to 

-d{y+y')^. 

Thus 

J^{y*-y'®+y"®-(y+y'+y")*}‘^=‘ * 

* This requires a little calculation. ** By Theorem 223. 
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Since *7o> •^s ^ finite, the left-hand side has a limit 

when Z -► oo.» Also yy" -*■ 0, and so y* + y'* tends to a limit, which 
can only be 0. It follows that 
(7.8.3) 

(y^-y'^+y”‘)dx={y(0)+y'{0))^+ j ^ (y+y' + y'y' 
which is positive unless 

y"+y'+y=o 
and y'(0)-l-y(0) = 0. 

These conditions give y= AY. 


(3) The underlying variational theory is a little more complex 
than that of § 7.6. If we put 
(7.8.4) = 

(7.8.6) J{y) = J(y,z) = j ^(y^-y'^+z'^)dx, 

and suppose that 

(7.8.6) y(0) = l, 2(0) = ^, y(QO) = 0, z{co) = 0, 

then the problem is a ‘Lagrange problem’ , viz. that of minimising 
J {y, z) when subject to (7.8.4) and (7.8.6). 

The extremals of the field, in space {x, y, z) are given** by 


(7.8.7) 


dy da: \dy') ’ dz da: \dz’) ’ 


where 0= Z— A(y'— z)=y*— y'*+z'*-A(y'— z), 

and A is a function of x defined by the equations themselves. In 

this case the equations (7.8.7) reduce to 


d 


(7.8.8) 2y + ^(2y'-|-A) = 0, 




and from these and (7.8.4) we find 
(7.8.9) y''"+y''+y=0. 


* The integrals 


j ^ yy'dx, j ^ nY'dz, j ^ yy"dz 


being oonvergent, by Theorem 181. 

^ For an aoooxmt of the general theory see Bliss (2). 



THH OALOTTLITS OF VARIATIONS 


191 


The most general solution of (7.8.9), for which y and z vanish at 
infinity, is 

(7.8.10) y=ae-i^+ae-P‘, 

where p == and the bar denotes the conjugate. 

The equations (7.8,10) and 

(7.8. 1 1 ) 2 = — ape-P^ — dpe-P^, 

(7.8.12) A=2ae-/»+2de-P», 

define a two-parameter ‘field’ of extremals through (oo, 0, 0). 
The ‘slope functions’ p, q, and the ‘multiplier’ A of the field 
are the functions 

p (x, y, 2 ), q (x, y, z), A («, y, z), 

obtained by expressing the slopes of the extremal, and the func- 
tion A defined by (7.8.8), in terms of x, y, z. A streiightforward 
calculation gives 

(7.8.13) p=x, q= -y-z, 

(7.8.14) A=2y.» 

The analogue of the Hilbert integral is 

= J [{<!>- p^j,-q9^)dx+%dy+%dzl 

This integral has properties corresponding to those of §7.5; it is 
independent of the path between its end-points, and its value 
along an extremal is the same as that of 

J=:^Fdx, 

Also, if is the extremal, and C any other curve, joining the 
end-points, we have 

(7.8.15) 

where S is the excess-function, defined here by 
€ (®, y, 2 , y', z\ A) -O (*, y, z,p, q, A) - (y' -p)% -(«'-S)®«- 

• p. j, A are in the first instance functions of * and the parameters o, a of the 
extremal. Here, in particular, 

y'= -ope-<>*-aj 5 e-f»=*, s'=o/>*e-l>*+3p*e-»»= -y-t. 

A=2*"= -2op*c-(>*-2dA*e-v*=2y- 
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In this case it will be found that 
(7.8,16) s^{y^z + {y* -- z)^ 

and •^^-(1 + 0®* 

Since 6 reduces to (y+y' + 2)^ when y'—z, we obtain 



+ 2'2)da:=:(l + 0" + 





which is (7,8.3). We have thus proved Theorem 260 iii three 
difEerent ways. \ 

We have supposed that y (0)4=0, so that we can take y(0)=k The 
case in which y(0) = 0 may be discussed similarly*. 

Our object has been only to illustrate a method and not to discuss a 
difficult general theory, and we have therefore presented the argument 
very shortly. The following remarks'* may help to make the method 
intelligible. 

(a) The integral J* constructed from a two -parameter set of extremals 
is not necessarily invariant. Here we can verify the inv€triance bf J* 
directly; in fact 


This invariance ensures that our extremals form a ‘ field*. The ‘reewon * is 
to be found in the fact that they pass through a fixed point, 2 , and A 
all vanishing for a; = oo. 

(h) In this case <]> is quadratic, and 

^ = i {(y’ -?>)• + 2 (3/' -P) (*' -q)<t>„-¥ (*' - q)* 

which leads immediately to the formula (7.8.16). 

(c) Suppose that E and C are defined as above, thari^o positive 
axis of Xf and L an arbitrary curve joining (0, 1, () to the origin. Then 

=(l + 0*+J^ (y+y'+y")**!- 

This is (7.8.3); the argument avoids a direct calculation of Alterna* 
tively we may argue 

J 0 


» Compare $ 7.6. 

For which we are indebted to Prof. Blias and Mr L. C. Young. 
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In order to deduce Theorem 259 from Theorem 260, we apply 
the last theorem to y {x/p) instead of to y (*). We thus obtain 

{p^y^—p^y'^+y"^)dx>o. 

Since this is true for all positive p, and in particular when 
/j* = Ji/2 Jo , it follows that Jj* < 4 Jj unless y {xjB) = AY{x). 


7e9e A simpler problem* It is interesting to observe that the corre- 
sponding theorem for the interval ( — oo, oo) is much more elementary and 
quite different. 


261 • If y and y" are in ( — oo, oo), then 

y'^dx^ < j y^dx j y"^dx 

unless 2 / = 0. The {unit) constant is the best possible^ 

In fact (as in the proof of Theorem 260) yy' 0 and 

Ji>= j.j. 


To prove the constant best possible, we take y = sin a? for \ x\ ^nn, i/=s0 
for I « I > nw, and round off the angles at a? = + nTr so as to make y" con- 
tinuous. We can plainly do this with changes in the three integrals which 
are bounded when n -► oo, and then each of them differs boundedly from nir. 


so that 

if n is sui&ciently large. 


Jia>(l-.€) JoJi 


MISCELLANEOUS THEOREMS AND EXAMPLES 
262* If y (0) = y (1) = 0 and y' is L*, then 

unless y = ca;(l — a;). 

[If 

then(J27) is aj(l -a;)y"+2y = 0, 

and y = Ota; ( 1 - a;) is an extremal satisfying the conditions whatever be a. 
By varying a we can define a field* round any particular extremal. It will 
be found that in this case *7 ( F) = 0 and 


p = 


l-2a; 

a?(l-a;)‘ 




l-2a; 
2a; (1 -a;) 


u c , f , l~2a; 

y, x(i-x)^{ ’ 


* The field differs slightly in character from those described in § 7.6, since each 
extremal passes throng (0, 0) and (1, 0). 
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The underlying identity is 

The theorem should be compared with Theorem 225. For fuller details, 
and proofs of both theorems by means of Legendre functions, see Hardy 
and Littlewood (10).] 

263. then 

4JK>{y(0))* 

unless ys=ae~^». 

264. If 

(a) 2/(-l)=-l. y(I)=l, 3('(-l) = y'(l) = 0 

and 1; is a positive integer, then 




with inequality unless 

[This is an example of the theory of § 7.8, simpler than that in the text. 


In this case 


S = 2'**= - - 2k (z' - q) 


265. If y satisfies (a), and has a second derivative y'' for every x of 
( — 1, 1 ), then I y''{x) | > 2 for some x, 

[This theorem, which is easily proved directly, corresponds to the 
limiting case A; = oo of Theorem 264. The extremal curve of Theorem 264 
reduces to 

yz=zx~-“X^ sgn x. 

For this curve 2( 1 — [ a; | ), and y"= — 2 sgna? except for a; = 0. There 
is no second derivative at the origin.] 

266* If ^ is z'=y, and 

y{0) = y ( 27 r) =s(0) = 2(27r) = 0, 

then 

riy'»-y*)dx= 

Jo ^ Jo I 2 — 2oosa?— a?sma? J 

[This identity, which gives another (though less simple) proof of 
Theorem 258, is the result of treating Wirtinger’s inequality as a case of 
Lagrange*s problem, on the lines of § 7.8.] 

267. J"(y»+2y'*+y"»)d*>}{y(0)}» 

unless y = Ce~®(a? + 2). 

268* If 1 and y and y" are JD*' in ( — 00 , cx>) or (0, 00 ), then 
(/ 1 y' \*dx)»^K(k)S \y\^dxS\y" l*d», 
the integrals being ti^en over the interval in question.* 

* Hardy, Littlewood and Landau (1). 
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269. If Jfc> 1 and y and y" are L* and i»' respectively, then 

y I*'**) (|_^i 3/" 1*' <^y t 


unless y is null. 

270. Ify'isJD*,then 




y=airb' 

where a and 6 are positive, in whioh case there is equality. More generally, if 
(iii) Z>A;>1, r = |— 1, 


and y' is positive and L^, then 


(iv) 

where 

(V) 

unless 




rT{llr) 


[It is easy to prove an inequality of the type of (i) but with a less 
favourable constant. Thus, if we denote the integrals in (i) by il and J 
respectively, we have / r* \2 

y^ = H^y'dtj ^Jx 

and so K= f ^dx<4J^, 

J Q X J 0 ^ 

by Theorem 253. We know no elementary proof of the full result. For 
details of the variational proof, which is much more difficult than that 
of Theorem 260, we must refer to Bliss (3).] 

271. If a> 1, and (5(/) is the geometric mean off over (0, x), then 

unless /= G exp { — 

[See Hardy and Littlewood (7). The limiting case a = 1 corresponds to 
Theorem 335.] 

272. In the problem 


* /•* 
j y 


*da? maximumj I x^y^dx and 


f'^dx given'. 


the Euler equation is of the form 

y"4-(o + 6aJ*)2/==0, ^ 

and is soluble by parabolic cylinder functions. It has a solution y — 
if 6=:-a»=~{2o)*, 

[This gives the variational basis for Weyl’sinequality(seoTheor©m226).] 

X3*a 



CHAPTER VIII 


SOME THEOREMS CONCERNING BILINEAR 
AND MULTILINEAR FORMS 

8.1. Introduction. In this chapter we prove a numW of 
general theorems concerning the maxima of bilinear and multi- 
linear forms. In the early part of the chapter we consider forms 
in n sets of variables, but suppose the variables and coefficients 
positive. Later, we abandon this restriction, but suppose that 
»= 2 ; and most of the latter part of the chapter is occupied by 
the proof of an important theorem of M. Riesz concerning 
bilinear forms with complex variables and coefficients. 

POSITIVE MULTILINEAR POEMS 

8.2. An inequality for multilinear forms with positive 
variables and coefficients. We suppose that 

Vjy •••> 

are n sets of variables, i,j, running from — oo to oo; and that 

£, 2, S' 

% i 

indicate respectively summation with respect to all suffixes, 
summation with respect to i only, and summation with respect 
to j, ..., ik (all suffixes except «). The sum 

^=Say„.fca:^y^...2fc 

is called a multilinear form in the variables x, y, ..., z. When n is 

1, 2, or 3, the form is said to be linear, bilinear, or trilinear. 

If the series are absolutely convergent, then 

8 - laii . . . 2 * = . 2 * = • • • • 

i I i i 

273. Suppose that 

(8.2.1) 0<agl, 0<i3gl, ..., 0<y<l 
and 

(8.2.2) v"=- gMin(a,i3,...,y); 
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that a, jS , y are defined by 

(8.2.3) a-a=/3-^=...=y-y = ?^±J^ltyj:i 

n— 1 

(so that 0 g a g a, 0 gy gy), and that p, a, r are positive and 

(8.2.4) ap + ^ff+...+yT=l. 

Suppose further that 

Xi^O, J/y>0, 2*^0, 

(8.2.6) Sa:<»/“gZ, 'ZyfiP<Y, , Y.z^Vv^z, 

and 


(8.2,6) S'ajy jj. - .4^ g S'aT^ g JB, 

A; 

^Acw /8'=Safy...*a:<y^...2fc<45 5^.._ CvX'^YP ... Zv. 

We have in fact 


1-3 - #+^+...+y— 1 o fi 

1 — a — p— ... — y= ■ = a — a = jS — /3= ..., 


and so, by Theorem 11,* 

iS" = S {a/‘x^’’‘)^ (o'y^^)^ . . . (a’’’ z^Y)y — y 

< (So#»»v«)a (Zaoy^P)^.. (Za^z^y)^ (ZxV«y^P ... z^yf-'^-P- 
= (Sa:V“SV)S ... {Zz^yZ'a-'fiZx^^ ... Zz^y)^-^-"^ 

i i k k i k 

g (4Z)« ... (C7Z)y (Z7 ... Z)i-5— -r 
=^55^...c9Z“r^...zy. 




We note some special cases. 

(1) If a+ /S+ ... +y= 1, 

then a=a, j8=j8, ..., y=y, 

and the statement of the theorem becomes simpler. 


(2) When n=2, the second of the conditions (8.2.2) is satis- 
fied automatically. Ifwevrrite'’ 



then 





1 



» Extended to inBnite series: we shall not usually repeat this remark, 

^ In the preceding chapters the letters p and q have been reserved for the weights 
of mean values. In thin chapter they are not required for this purpose, and we use 
them as indices. 
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Exchanging p and a, and A imd B, we obtain 
274. Jf 

p^l, q^l, P>0. a>0. ^ + = 

Xai/>iA, Sic^SX, 

i 1 i i 

then 8==ZatjXiy ^ S A^J>‘B^^XVp 

(3) An interesting and still more special case is that in Which 
p=a=l, q=p\ 

276. If p>l, q>l, ^+^=1. 

Zaij^A, ZUij^B, Zx^^X, Zy^^^Y, 
i i i i 

then 8=ZZatfXty^^(BX)^p(AY)^. 

For the case p= 5 = 2 , seeFrobenius (1) and Schur (1). 


8.3. A theorem of W. H. Young. Another specialisation of 
Theorem 274 leads to an inequality of W. H. Young which is 
very important in the theory of Fourier series. 

Suppose that a=p> 1, so that 


2 I 
"1* 

V 9 



1 , 


1 1 

— 1 — > 
P 9 


1 ; 


and take Then 

ZUff = Zai/> = ZaJ> = A, 
i i 

say, for every j and i respectively. Hence, if we write 

(8.3.1) *n= S XfPf, 
we have, by Theorem 274, 

(8.3.2) So„2„ = YMi+^Xtyf g A^pX^ 

Since (8.3.2) is true for aU a„ for which ZaJ>^A, it follows, by 
Theorem 1 5, that p' g Xp'i^ Yp^'^. 

This must be replaced, when p = 1, by 2 „ g XV^ Y^. 

We have thus proved (apart from the specification of the oases 
of equality) the following theorem of Young*. 


* Young (3, 4, 6). Young does not consider the question of eqnnlity. 
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276. If p>l, q>l, -+->1, 

Pi 

and is defined by (8.3.1), then 

PQ g P 

(8.3.3) ^ (Zyfi) P’^Q-Pd . 

Equality can occur only if all the x, or all the y, or all the x but one 
and all the y but one^ are zero. 

We add a more direct proof which enables us to settle the 
question of equality. If we write l/p=l~A, llq—l — fiy then 
A > 0, > 0, A+/X < 1, and we can enunciate the theorem in the 

following form. 

277. If A>0, jLt>0, A + ^<1, 

z is defined by (8.3.1), and ©^(a:) as in § 2.10, then 

(8.3.4) (2) ^ ®wi-A) {*) (2/). 

with equality only in the cases specified in Theorem 276. 

Let V = 1 — A — /i. It follows from Theorem 1 1 that 

1 X _v_ A 1 _L. 


Applying this inequality, with 
to (8.3.1), we obtain 

1 1 ft 1 A 1 ^ 

(8.3.6) »„‘'^(Sa:/+’')"(Sy/+-)'’' S xf+’’yf+% 

i i i-hj=>n 

(8.3.6) Ci©1^(®)@l^(2/). ? 


Hence 


/A + l' 


A+i. 


o'V (A+i') 

X+y 

Sinoe the double sum here is equal to 


1 1 
fi+y A+v 


M ^ — n 

W IA-4'*' A+i' 


we obtain 


smd this is (8.3.4). 


SSa:r''y/*‘'=®'‘r(*)®i:(2') 

it JTfi >.+y 

1 1 1 

n fi+y x+y 
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We can dispose of the question of equality as follows. 

If not all X, and not all ff, are zero, there are n such that 
> 0 for some t, j for which i +j = n. We call the lattice point 
corresponding to such a pair i, j a point . Then (8.3.6) is true, 
for such an n, if, in the first two sums on the right, we limit 
ourselves to values of i aadj corresponding to points . It there 
is equality in (8.3.6), then, for such i, j the ratios 

1 1 1 1 
*/+’' y/+‘' : ar<^+’'y/+’’ 

do not depend on i and j, and the corresponding x^, and the 
corresponding , are all equal. It follows that there can, for any 
», be only a finite number of points P„ . 

Suppose that all these conditions are satisfied for a certain n. 
Then equality will still be excluded in the next inequality (8.3.6), 
imless the i and j corresponding to the exhaust aU i and j 
for which a:* > 0 and > 0. It follows that the total number of 
positive Xi and yj is finite. There is therefore a single poiut for 
which > 0 and n = i +^' is a Tninimum . For this n there is a 

unique P„ and, if there is equality iu (8.3.6) for this n, then x^=0 
and y^—O except for the corresponding pair i, j. 

8.4. Generalisations and analogues. Theorems 276 and 
277 have many interesting specialisations, generalisations, and 
analogues. We state a number of these without proof. 

278. If\>0,n>0,...,v>0,X+ft+... + v<l, and 

«>„= S Xfyf...z^, 

(hen 

© i__ {w) i (a;)©j_(y) ...©j_ (z), 

1— A— — — V 1— A 1— ft l—i' 

unieaa aU numbers of one set, or all but one of every set, are zero. 

279. If 

2k 

(hen Sc„*g(Sa«^^)“"^ 

unless aU a but one are zero. 
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Theorems 277-279, being ‘homogeneous in S* in the sense 
of § 1.4, have integral analogues. 

280* If A^O, 1, 

a 00 \llr 

-/■**) 

and h{x)- f f{t)g{x-t)dt, 

J —00 

then 3 1 W<3j (/)3 ijffh 

i-A-M 1-A 1-^ 

unless f or g is null, 

281* If Of ^>0, A + ft<li 

^Af) = f/dx, 

and h(x) = j f{t)g{x—t)di, 

then 1 (A) < 3 (/) 3 j_ {?). 

1 — A— M 1— A l — fi 

unless f or g is nvM. 

282. If h is an integer and 

^(a:) = J ...J f(xj)f{x2)...f{x^i) 

X f(x—Xi — ... —Xii^i)dxidXf^tt-dxii^i, 

r<o / r«> 

<A«» j f^'‘~^{x)dxj 

288. If 

^ (») = J /{*l) • ••/{**-!)/(* - *1 - — - ®fc-l) <^1 • • • » 

the iniegration being defined by 

hXfix, 

r <f>^{x)dx<^j^f'‘-Hx)dxj . 


then 
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284. Jf f{x) has tite period 2n, and 

^ “ ( 2 ^;^ J _/ ■ 

X /(a; — a?! — . . . dx^dz ^ . . . dxj ^^^ , 

1 fjr / 1 r«’ \2A:-1 

^ J (2^ J * 

8.5. Applications to Fourier series. Theorems 279 and 284yiave 

important applications in the theory of Fourier series. Here we are'^pon- 
oemed with functions and coefficients which are not positive, but the 
theorems which we have proved are sufficient for the applications. 
Suppose first that /(a;) and g{x) are complex functions of L*, and that 

-.00 - 00 

are their complex Fourier series. Then it is well known that 

( 8 . 6 . 1 ) Xa„b„ = ~j f(x)'g^)dx 

(the bar denoting the conjugate). In particular, if /(a?) is L\ 

( 8 . 6 . 2 ) 

Conversely, if L | a„ |* is convergent, there is onfix) of L* which has the 
a« as its Fourier constants and satisfies (8.6.2). 

These theorems ( ‘Parseval’s Theorem* and ‘ the Riesz-Fischer Theorem *) 
were generalised by Yoimg and Hausdorff. We write 

(8.6.3) e,(o) = (S|oJ«-)‘/«’, 

so that 6.(a)i3 @,(| a |) as defined in §2.10, and (8.6.2) may be written 

(8.6.4) ®,(o) = 3.(/)‘ 

Young and Hausdorff proved that, if 

( 8 . 6 . 6 ) 

then 

( 8 . 6 . 6 ) 3 ,'(/)^®,(®) 

and 

(8.6.7) S,'(o)g3,(/)- 

The limitation on p is essential. The theorems were proved first by Young 
(3, 4, 6) for a special sequence of values of p and p', viz. 

OJL 

(8.6.8) (fc=l,2,3,...), 
and then generally by Hausdorff (2). 
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We confine ourselves here to the case (8.6.8), considered by Young. 
In this case (8.6.6) and (8.6.7) are corollaries of Theorems 279 and 284 
respectively. For example, the c„ ofTheorem 279» is the Fourier constant 
of ^=/*, and so 

= S|c„l>gS^J^«2ft_l)(a), 

which is (8.5.6). Similarly is the Fourier constant of the ^(a;) of 
Theorem 284, and so (8.6.7) may be deduced from Theorem 284. 

The proof of (8.5.6) and (8.6.7) for general p is decidedly more difl&oult: 
see §8.17. 

It is interesting to observe (as another application of Holder’s in- 
equality) how (8.6.7) may be deduced from (8.5.6). Write 

= 1 On I SgnOn = I On I 

if a,|4:0 and | n | and — O otherwise; and let 

gf(a?) = S6ne"<». 

Then £ |o„l«''= Sa„6„=^f f{x)g(x)dx. 

-N -N -IT 

since ^ is a trigonometrical pol 3 momial. Hence, using Holder’s inequality 
and (8.6.6), we obtain 

£ |a„l»'g3,(/)3,-(!7)S3,{/) <5,(h) 

^ / N \l/p / ^ 

=3.(/)(_s =3,(/)(_s loj'') . 

Transposing the last factor, and then making N tend to infinity, wo 
obtain (8.6.7). 

8.6. The convexity theorem for positive multilinear 
forms . In this section we prove a simple but important property 
of multilinear forms with positive variables and coef&oients. 
The theorem which we prove is a mere corollary of Holder’s in- 
eq^uality, but it is useful, and will serve as an introduction to the 
deeper theorem of §8.13. 

286.0 Suppose a S 0, a: ^ 0, . . . , z ^ 0, ond 

is the upper bound of 

for aMx,y,..., zfor which 

Sa:V“gl, Sz^rgl; 

• Now, of oonrae, formed from complex a, 

* M. Eiesz (1): Riesz has »=2. 
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Otm ia a convex function of a, ]8, yin (he region 

a>0, j3>0 y>0. 

By a convex fraction of nvariables a, j8 we mean (§3.12) 

a function convex along any straight line in the space of a, )3, . . . , y . 
We have to verify that if 

^0, ^2 = "t" ^2 ~ 

a=<iai + i2K2> V~^'i.yx'^^2yi^ 

then 


( 8 . 6 . 1 ) 


■^a./5,...,y = yi ft yi* 


Now 

8 = Soiey . . . 2 = S (aa:“i'“y^i/^ . . . zVi/y )<i {ox'^'^yM . . . iSfJY)<t 

^ (Saa:“»/“y^i'^ . . . 2)'i/y)'» (Soa:“.'“y^*'^ , . . zVt'y)'. . 
Since S (x“i'“)^“> = g 1 , . . . , 


the first sum on the right does not exceed and simi- 
larly the second does not exceed y^. This proves (8.6.1). 

The theorem may be extended to the closed region «^0, 
/3^0, ..., if we replace the conditions 1, 1, ... by 

1» y ^ 1. ••• when a, )3, ... are zero. 

Suppose for example that n—2 and 

i i 

Then and so that for 

0<a< 1. If p> 1, g=p', we may take a=l/p, 1 — a= l/j; and 
then we obtain 

which is equivalent to the result of Theorem 275. 


OBITBBALITIBS COKCEBKINO BILINBAB BOBMS 

8.7. General bilinear forms. So far we have been occupied 
with ‘positive’ multilinear forms, i.e. forms whose variables and 
coefficients are non-negative. The most important multilinear 
forms are bilinear, and the remainder of this chapter, and most of 
the next, is concerned, from one point of view or another, with 
bilinear forms, which will not generally be positive. 
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We shall denote the form whose ooeffioients are by A, 
similaxly with other letters. We change the convention of § 8.1 
concerning the range of the suffixes; untiltheendof §8.12,i, j, ... 
run from 1 to oo. We write 

We also write 

(8.7.1) 

i 3 

When the form is positive 

(8.7.2) 4 = S = Sa:,. 7^ = Xy , 

i 3 i 3 

the convergence of any one of these series involving the con- 
vergence of the others and the equality of the three. The equations 

(8.7.2) are true also for complex a, x, y when the form is finite. 
We shall make repeated use of the following general theorems*. 


286. Suppose that 

, , 11, 11, 

p>l, q>l, -+~,= 1, - + v = ^ 
p p q q 

(so that p' >1, g'> 1) and that a, x, y are real and non-negative. 
Then the three assertions 

(8.7.3) \A(x,y)\iKiBp(x)®,(y) 
for all X, y ;^ 

(8.7.4) S^iX)gmj,(x) 
for all x\ 

(8.7.6) @p(y)i^©fl(y) 

for all y, cure equivalent. 


287. The three assertions 

(i) tAere is inequodity in (8.7.3), unless either (x^ or (yf) is null; 

(ii) tiiere is inequality in (8.7.4), unless (x^) is null', 

(iii) Ocere is inequality in (8.7.5), unless (y^) is null; 
are also equivalent. 

• For the case see HelUnger and Toeplitz (t); for ?=p', F. Mm* (!)• 

The Bubstanoe of the general theorems is to be found in M. Riesz ( 1 ). ihe im- 
portant oases are naturally those in which K has its least possible value, i.e. is e 
boundof A (§8.8). 

^ Here A £0; but we write j A | for A in view of Theorem 288. 



206 


SOMB THBOBEMS OOKOBBNIKO 


288. When the forma are finite, Theorems 286 and 287 are true 
alao for forma toith complex variablea and eoeffieienta. 

Theorem 286 is a simple corollary of Theorems 13 and 15. 
It follows from (8.7.2) and Theorem 13 that 

(8.7.6) A = (y)®^ (X), 

BO that (8.7.4) is a sufficient condition for the truth of (|8.7.3); 
and Theorem 1 5 shows that it is also a necessary condition, ^ence 

(8.7.3) and (8.7.4) are eqmvalent, and similarly (8.7.3) and 
(8.7.6) are equivalent. 

There will be inequality in (8.7.3) unless (y^) is null or there 
in equality in (8.7.4). Hence the second assertion of Theorem 287 
implies the first. If the a:^, and so the X^, are given, we can, by 
Theorem 13, choose a non-null (y^) so that there shall be equality 
in (8.7.6). Hence, if there is equality in (8.7.4) for a non-null 
(Xt), then there is equality in (8.7.3) for a non-null (a:<) and (y^). 
Hence the two assertions are equivalent, and similarly the first 
and third are equivalent. This proves Theorem 287. 

Finally, the whole argument applies equally to complex a, x, y 
when the forms are finite*. We have only to use Theorem 14 
instead of Theorem 13. 

The most important case is that in which q=p', q'=P> when 

(8.7.3) , (8.7.4), and (8.7.6) become 
|XlgX@,(x)®p.(y), (Sp{X)<KQp{z), @,,(r)gX©j,-(y). 

BOUNDED BILINEAR FORMS 

8.8. Definition of a bounded bilinearform. Throughoutthe 
rest of this chapter we suppose, except when there is an explicit 
statement to the contrary, that the variables and coefficients in 
the forms considered are arbitrary real or complex numbers. 

We describe the aggregate of all sets (®) or Xj, ..., real or 
complex, for which 

(8.8.1) ®p(»)=@p(| * |)= (S I S 1 

as apace [p]. Here p is any positive number; but usually p > 1. 
• Otherwiac there are difficulties concerning the mode of summation of A.* 
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Similarly we describe the aggregate of sets (x, y) for which 

(8.8.2) ©p(®)gl. <B^{y)Sl 

as apace [p, g]. The most important case is that in whiohp = g=2. 
The importance of this spaoe was first recognised by Hilbert, and 
we may describe it shortly as Hilbert apace. 

In our general definition povq may be oo, if we interpret S* (*) 
as Max | * |. Thus spane [oo, oo] is the aggregate of sets (x, y) for 
which |a;| ^ 1, | y | ^ 1. 

A bilinear form 

(8.8.3) A=A(x,y)=i:SaijXiyf 
is said to be bounded in \p, 3] if 

(8.8.4) \A^{x,y)\= S tatjXiy^ ^M, 

where M is independent of the x and y, and of n, for all points of 
[P. ?]• We call An a aeclion (Abachnitt) of A : a form is bounded 
if its sections are bounded. 

It is plain that (8.8.4) will hold for all points of [p, g) if it 
holds whenever 

(8.8.6) ®p(x) = l, ©g(y) = l. 

In this case (8.8.4) may be written 

(8.8.6) Mn (*» 2^) I ^ 

and here both sides are homogeneous of degree 1 in a; and in y, so 
that the conditions (8.8.5) are immaterial. We might therefore 
have taken (8.8.6), with unrestricted x, y, in our definition of a 
bounded form. 

So far M has been any number for which (8.8.4) or (8.8.6) is 
true; if so, we say that A is bounded by M, or M is o bound of A. 
It is natural to take if to be the smallest such bound, and then 
we say that M is the bound of -4.® 

* If is the maximum of A^* binder the conditionfl 

then plainly ^ and Jf „ is bounded in n. Hence 

ifsa lim 

existsi and is the smallest bound of A» 
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An important special case is that in which p = 9 and 

when A is said to be symmetrical. In this case a necessary and 
sufficient condition that A should be bounded is that the qjtadratic 

A(x,x) = 'SIAijZiXf \ 

should be bounded. When we say that A {x, x) is bounded, we 
mean, naturally, that A {x, y) is bounded when the x an^ y are 
the same, i.e. that 

\A„(x,x)\^M 
for all X for which ©p (a;) = 1- 

In the first place, it is obvious that the condition is necessary, 
and that the bound of A (x, x) does not exceed that of A {x, y). 
That the condition is sufficient follows from the identity 

(8.8.7) (*, y) = lA^{x+y,x+y)-lAn{x-y,x-y). 

When p = 2 we can go a little further; the bounds of A (», x) and 
A (x, y) are the same. In fact, if if is the bound of A {x, x), then 
(8.8.7) gives 

1 A, (a;. y) I ^ JifS (I »+y |*+ 1 |*) = iifS (1 a; 1*+ 1 y I*) S if. 

It is evident that, when the coefficients a are positive, A is 
bounded if it is bounded for non-negative x and y, and that its 
bound may be defined with reference only to such x and y. If 

A*-'SS\ai)\Xfyf 

is bounded, A is sadd to be absolutely bounded. 


8.9. Some properties of bounded forms in [p, q\. The 
theory of bounded forms is very important, but we oaimot 
develop it systematically here. We prove enough to enable us to 
give an account of some special forms in which we shall be 
interested in the sequel. 

We takep> 1, y > 1, and, as usual. 


P' 


P 

“p-r 




g 

g-i* 
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289. If A has the bound M in [p, q\, then 
(8.9.1) S I \P' ^M^', SI ay I®" g 

i j 

for every j and i respectively. 


Take all the x zero except , tcj = 1, and all the y zero except 
y\>ys>—,yj- By (s.s.e), 


J 

'^i^y} 




(flS'd')"'’- 


Since this is true for all and all J, it follows, by Theorem 15, that 

S 1 ajj I®' g itf®". 

j 

This is the second of (8.9.1), and the first is proved similarly. 


Thus a necessary condition for boundedness in [2, 2] is that 

Slo<^l*<cx) 

* i 

for ally and t. The condition 

SS j a^j I * < 00 

is sufficient, since then 

<,i t,i iJ i i 

but this condition is by no means necessary, even when the coefficients 
are positive. Thus, as we shall see in § 8. 12, 


is bounded. 


*+.; 


290. Any row or column of a bounded form is absolutdy con- 
vergent. 

For 

S 1 \<\y^\(^\x, \p)yp (S | < if j 1 @p(x), 

by Theorem 289. 

It is plain that, when ^ 0, a necessary condition that A should be 
bounded is that 

(8.9.2) JlJkiifXiyi 

should be convergent for all positive x and y in [p, 5]. It is natural to ask 
whether this is also true for bounded forms with arbitrary real or com- 
plex coefficients, i.e. whether, when A is boimded, the series (8.9.2) is 
necessarily convergent (for the x and y of [p. g]) in any of the recogmsed 
senses. The answer is affirmative : if i is bounded, the series (8.9.2) is 
convergent (indeed uniformly) in the three standard senses, as a double 
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series in Pringsheim*s sense or as a repeated sum by rows or columns. 
But the idea of the convergence of the double series is irrelevant to our 
present purpose (and is not very important in the general theory), and 
we shall not prove these theorems. See HeUinger and Toeplitz (1) for 
the case [2, 2]. / 

8.10. The Faltung of two forms in [p, p']. We suppoi^ now 
that If ^ and B are bounded in [p, p'], with bounds M 

and N, then, by Theorem 289, 

k k 

and therefore, by Theorem 13, the series 

(8.10.1) fti=‘^0'ik^ki 

k 

is absolutely convergent. We call 
F= F (A, B) = 

the FaUung of A and B. The order of .<4 and B is relevant, F {A, B) 
and F{B, A) being usually different forms. 

291. If A and B have the bounds M and N in [p, p'], then F is 
bounded in [p, p'], and its bound does not exceed MN. 

Suppose that m ^ « and that ar< = 0 for i>n. Then, since A is 
bounded by Jf in [p, p'], we have 

Mmh, ^Vk 

|fc“l i—l 

for all X and y for which ©,(*) ^ 1 and 1- Hence, by 

Theorem 15, 

P 

S atifXi 

k~l i-l 

for m^n; and therefore 

(8.10.2) S a^xt SMP. 

Similarly 

(8.10.3) S 

But 

(8.10.4) 
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From (8.10.2), (8.10.3), (8.10.4), and Theorem 13, it follows that 


I ^MN. 

which proves the theorem. 

It is plain that we can define the Faltung of A and 5, whether 
A and B are bounded or not, whenever the series 

k k 

are convergent. 


8.11. Some special theorems on forms ih [2, 2].^ In this 
section we confine otirselves to the classical casep=^ = 2, and 
suppose the variables and coefficients real (though not generally 
positive). We suppose then that -4 is a real form, and denote by 

(8.11.1) A^ = 

the form obtained from A by exchanging suffixes in . 

If 


(8.11.2) Sa2^/k<oo 

k 

for all i, the series 

(8.11.3) 

k 

are absolutely convergent, and, by (8.10.1), 

(8.11.4) C'(a:,y) = SSc,,a;»y, 

is the Faltung F (A, A') of A and A'. In particular 0 («, *) is a 
quadratic form whose section is given, after (8.10.4), by 

(8.11.6) C^(a:,a!) = S^^S%fca:ij 

We write 

Gix,x)^N{A) 


and call C (x^x) the norm of A. When A satisfies (8.11.2), we 
say that the norm of A exists^ The existence of 37^(4) is, after 
Theorem 289, a necessary condition for the boundedness of 4. 

If 4 is bounded, with bound if, then, by Theorem 291, N (4) 


• Hellinger and Toeplitz (1), Sohur (1). 
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is bounded, and its bound P does not exceed M*. On the other 
hand, whenever N (A) exists, 

I (*,y) I = ^yi^aiiXi : ( j I 

Hence, if N {A) is bounded by P, A is bounded by P*. 

Collecting our results, we obtain 

292. A necessary and sufficient condition that a real form A should 
be bounded in [2, 2] is that the rumn N {A) of A should exist and be 
bounded. If M is the bound of A, and P that ofN{A), then 

P^M^. 

A useful corollary is 

293. If A, B, ...is a finite set of forms whose norms exist, and 

H{x,x) = N(A) + N(B)+... 

is bounded, with bound P, then A, B , ... are bounded, with bounds 
which do not exceed P*. 

In fact, if Nn{A), ... are sections of N {A), ..., then ... 

are non-negative*, by (8.11.6), and 

(x, x) = N^{A)+N^(B) + .... 

Hence N^^A),... are bounded by 

8.12. Application to Hilbert’s forms. We now apply 
Theorem 293 to two very important special forms first studied 
by Hilbert. 

294. The forms 

3 .^\ , 

t+3-V i-y 

where 2, and the dash implies the omission of the terms 

in which are bounded in real space [2, 2], with bounds not 
exceeding rr. 

• That ifl to say, aBSume non -negative values only for real x. The phrase * positive 
form * has been used in this chapter in a different sense, that of a form with non- 
negative ooeffioients and variables. 
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It is plain that each form satisfies condition (8. 1 1,2). We write 

and calculate 
If i—j, we have 
( 8 . 12 . 1 ) 


fe-i 


1 

(» + *-!)* 


(»-&)“ 


If t+j, then 


S' 

~oo 


1 

{»-*)*“ 


iv*. 


^ ft-i(t+*-l)(i+ifc-l)^fc=i (i-A:)(j — ft) 

^ 2' i — ._L_ S' / _i i \ 

the dash here excluding the values k=i and k=j. If £ is greater 
than both | i | and \j |, then 

_ 2 / 1 1 1 1 \ 

+ K i-K - i + K)’ 

with two series unbroken except for the omission of terms with 
denominator 0, and the bracket tends to zero when jP->oo.» 
Hence 

( 8 . 12 . 2 ) 

From (8.12.1) and (8.12.2) it follows that 
(8.12.3) N{A)+N{B) = ji:x^+2l.^'^^, 

The first form here has the bound and, since 


S' 


.-S' 






the second satisfies the conditions of Theorem 275, and has the 
boimd f w*. Hence J (H) has the bound w*, and Theorem 
294 follows from Theorem 293.* 


• All terms cancel except a number independent of K, 
■> The proof is that of Sohiu (1). 
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That A is botinded can be proved more simply: we give a niimber of 
proofs in Oh. IX. 

A is absolutely bounded (§ 8.8), since its coefficients are positive. It is 
important to observe that this is not true of B, To prove this, it is enough 
to prove that 

1*-Jl 

for a positive set (sc, y) for which and 2]y^* are convergent. '\^e take 

(logt)-i (i> 1), y,=j-i (log;)-* (j > 1) 
and a?i=:a? 2 , ^ 1 =^ 2 . Then 


LL' 


^iVi 






V i V 1 > i /** — T. ^ 

it=i fc i-t +1 i (log 1)* ~k^ihj it+i u (log u)* k=i fc log (fe + 1) ’ 
and this series is divergent. 

We shall see in Ch. IX that A is bounded in [p,p']. B is also bounded 
in [p<p'], but the proof is much more difficult: seeM. Riesz (1,2), Titch- 
marsh (2, 3). 


THE THEOREM OF M. BIESZ 

8.13. The convexity theorem for bilinear forms with 
complex variables and coefficients. We prove next a very 
important theorem due to M. Eiesz^^. This, like Theorem 285, 
asserts the convexity of log .If jj , where 3f p is the upper boimd 
of a form of the type A ; but in Eiesz’s theorem the form is 
bilinear, the a, x, y are general complex numbers, and convexity 
is proved only in a restricted domain of a and j8. 

It is essential to Eiesz’s argument that .Sfo,/} should be an 
attained maximum and not merely an upper bound; and we 
therefore consider a finite bilinear form 

m n 

(5.13.1) - - 2 S * 

<-i 

296e Suppose that is the maximum of A for 

(8.13.2) S|*,|V«^1, 

U being understood that, «/ a^O or /3»0, (hese inequalities are 

• M. Riesz (1). Our proof is sabstantially that of Riesz. An alternative proof has 
been given (not quite completely) by Paley (2, 4)* 
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replaced by j a:J g 1 or | i ^ 1, Then logM^j in convex in the 
triangle 

(8.13.3) Ogagl, a+jS^l. 

We have to prove that, if (ai, jSj) and (a^, are two points of 
the triangle (8.13.3), 0<<< 1, and 


(8.13.4) a = aii + a2(l-«), p = + ^^[1-f), 

then 

(8.13.6) 


We need the fact that as a->0. Let (x,y) be a 

maximal set for (a, /3), («o>yo) (0./3). Then (x,y) is 

a ‘permissible’ set for (0,j3) and (m-“xo,yo) for (a,j3). Hence 

and Also as 

]8->-0. There is thus some continuity of -Sfa,yy at the comers (0, 1) 
and (1, 0) of the triangle (8.13.3), at other points of which 
is obviously continuous. Hence, after Theorem 88, it is enough 
to prove that (8.13.6) is true, when Oi, aj, ^2 are given, for 
some t for which 0<f< 1. It is, moreover, enough to consider 
for a > 0, )3 > 0, in which case we can define p, g, p', g' by 


(8.13.6) a = 


P 


-+A = 1, -+-, = 1. 

^ g' P P ? ? 


We may then write (8.13.2) in the form 

(8.13.7) ®j,(a;)^l, ©^(ylSl; 

and the inequalities (8.13.3) are equivalent to either of 

(8.13.8) 
and 

(8.18.9) p’^g^l- 


We shall also write, as in § 8.7, 

(8.13.10) Z^=Z^(a;) = Sa<,®ii 

if 

so that 

(8.13.11) A=SX^y/=Sa;iri 
or simply 

(8.13.12) ^=SXy=S*r. 
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Theorem 286 enables us to give another definition of 
which is more convenient for our present pmpose. It is plain 
that A attains its maximmn for a set (x, y) in which 

(8.13.13) 

and p is the least number K satisfying 

(8.13.14) |4|<X@^(x)©„(y) 

for all such (x, y). Since both sides are homogeneous of decree 1 
in X and y, the restrictions (8.13.13) are now irrelevant, andylf;^^ 
may be defined as the least K satisfying (8.13.14) for all (x, y).» 
By Theorem 286, this is also the least K satisfying 

(8.13.15) 
for all X, or 


(8.13.16) 6p,(r)gZ©,(y) 
for all y. We may therefore define Mg^p by 

(8.13.17) M^p-Max -Max , 

the maxima being taken for all non-null sets x or y. 

8.14. Further properties of a maximal set (x, y). Suppose 
that (x*, y*) is a set of (x, y), subject to (8.13.7), for which | A | 
attains its maximum, and that X*, Y* are the corresponding 
values of X, Y. It is obvious (as we have observed already) that 

(8.14.1) ©,(x*)=l, ©,(y*)=l. 

Also, as in (8.7.6), 

(8.14.2) \A\£<B^{X)<S,(y), I A | g@,.(r)@^(x). 

There must be equality in each of (8.14.2) when x,y have the 
values X*, y*\ for otherwise we could increase | A | by leaving 
the X, X unaltered and changing the y, or by leaving the y, Y 
unaltered and changing the X. Hence 

Mg,p^\A{x*,y*) 1 =©^(X*)©,(y*) = ©,.(y*)@p(x*). 
Further, by Theorem 14, 

or 

(8.14.3) \Xf*\=u>\yi*\^-\ 

» Thift is merely a repetition of an argument used already in { 8.8. 
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where to is positive and independent of j\ and 

argX'^y* 

is independent of Hence 

M^p= I ^ (x*,y*) I = 1 ^X*y* 1 = S I X*y* [ = wS | y* |«=a.. 
Substituting in (8.14.3), and adding the corresponding result for 
Yf* , we obtain 

(8.14.4) 1 Z,* I = I y / j 7,* | = | l^-^. 


8.16. Proof of Theorem 295. In what follows we suppose 
(x, y) a maximal set (for the indices a, jS), omitting the asterii^. 
We write Pi=l/ai, and so aD.,&TLdLM,Mi,M^ioT 
^at, pf of p,pi , ... excludes the points (0, 1), (1, 0) of 

the triangle, but, as we remarked in § 8.13, this will not impair 
the proof. 

By (8.14.4) 

=(sirih')^i'x’=s^,.(r). 

Comparing this with (8.13.17), we obtain 

(8.16.1) Jf©gr-\)p.'(*) g 

Similarly 

(8.16.2) 

Hence, if 0 < t < 1, we have 


(8.16.3) 

Let us assume provisionally that there is some t between 0 and 


1 which satisfies 
(8.16.4) 


i = i Izl + 

P~Pi Pi’ y ii 3a ’ 


that is to say the equations (8.13.4), and that 
(8.16.6) @i7‘(a:)^@S=i!p,'(a5). (S/) ^ 

Then (8.16.3) and (8.16.6) will give 

for some t, and after Theorem 88 the theorem will follow. 
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It remains to justify the assumptions expressed by (8.16.4) 
and (8.15.6). Let us assume further that there are numbers ^ and 
V such that 

(8.16.6) 0<^^1, 0<t'^l, 

(8.16.7) 

Pi=(p-l)pi gi = (?-l)g8'v+g(l-v)l 

By Theorem 1 8,* r log @, (*) = log (x) is a convex fimctioV of r ; 

and the z, being a maximal set, satisfy (8.13.13). Hence \ 

(8.16.8) 
and similarly 

(8.16.9) s*:(j/)ge|«:ilS'’'(3/)®S'^~’'’fy)“®iri)S'‘'(J/)- 
If finally 

(8.16.10) ^=/-. 

P 2 1 — t Qi t 

then (8.16.8) and (8.16.9) will be equivalent to (8.16.6). 

In order to complete the proof, it is necessary to show that 
(8.16.4), (8.16.6), (8.16.7), and (8.16.10) are consistent. These 
conditions contain six equations to be satisfied by the four 
numbers p, q, p, v, and two inequalities. The first equation 

(8.16.10) gives 

Pi Pl^-^ 

and 

and the first equation (8.16.7) gives 

1 (Pi-l)fi+l _ Pat+Pi(l-t) __ t 1-t 
P Pt+PiP PiPt Pi P, ’ 

which agrees with (8.16.4). 

A similar argument applies to the equations involving q, so 
that (8.16.4) is a consequence of (8.16.7) and (8.16.10). Given 
Pit gi) Pat Qat t, we can find ft, v&om (8.16.10) and p, q firom 
(8.16.7), and these numbers will satisfy the six equations. 


* Strictly, l)y Theorem 18 leeteted ae Theorem 17 vae reeteted in Theorem 87. 
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It remains only to examine the inequalities (8.16.6). If ja and v 
satisfy (8.16.10), and 0<<< 1, the inequalities are equivalent to 

(8.16.11) |lg / <?L. 

l-t P2 

Since (a^, Px) and (ag, jSg) lie in the triangle (8.13.3), we have, by 
(8.13.8) and (8.13.9), 

and a fortiori 

P2 

We can therefore choose ^ so as to satisfy (8.15.11), and then all 
our conditions are satisfied. 

It will be observed that it is only in the last paragraph that we 
use the essential inequality a+ j8 ^ 1. When the form is positive, 
this inequality is irrelevant; log p is then, by Theorem 285, 
convex in the whole of the positive quadrant of (a, jS). 


8.16. Applications of the theorem of M. Riesz. (i) Theo- 
rem 295 is easily transformed into another theorem of very 
different appearance. 


296. Suppose that 

m 

(8.16.1) ^ a^jX^ (^ = 1, 2, .. . , 7i) 

and that M* y is the maximum of 

for lla;*lW“^l. 

Then log Jlf* y is convex in the triangle 

(8.16.2) ’ O^y^a^l. 

In fact, by (8.13.17), 

- - Tv/r 


Jlf*y-Max 


-- @„(a!) 


©l/a(^) 


a,y 


while 

Henoe 
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if y l/g'' s 1 — j8 ; and then the oonditionB (8.16.2) are equivalent 
to (8.13.8) or to (8.13.3). Thus is a convex function of 
(«, 1 — y), or, what is the same thing, of (a, y). 


(ii) 297. Suppose that Xf is defined by (8.16.1); that 


(8.16.3) 

for all x\ and that 

(8.16.4) 

Then 

(8.16.6) 

where 


n m 


lgjpg2. 




\ 


(8.16.6) m = Max | 1 . 

To deduce Theorem 297 from Theorem 296, we write a* 1/p, 
as before, and consider the line from (i, J) to (1, 0) in the plane 
(a, y). This line lies entirely in the triangle (8.16.2); and hence, 
by Theorem 296, 

for I < a< 1. It is plain from (8.16.3) that ^ ^ 1; and 


Max]^^ 
■“i-o S|a:| ='"• 


Hence Jf* 

which is equivalent to (8.16.5). 

The condition (8.16.3) is certainly satisfied (with equality) if 
(8.16.1) is a ‘unitary* substitution, i.e. a substitution which 
leaves S | a: |* unaltered*. This case of the theorem was found by 
F. Riesz (4), and the general theorem by M. Biesz (1). 


8.17. Applications to Fourier series. Out of many other 
important applications of Biesz’s theorems, we select an applica- 
tion of Theorem 297 to the proof of Hausdorff’s theorems’*. 


^ In thiB case n am. A real unitary substitution is orthogonal. 

b See §8.5. Riesz deduces these theorems in a different manner, and gives a number 

of other applications. 
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(i) Suppose that m is an odd integer and 

im 


/„(0)= S a 

-im 


The substitution is unitary, so that 

Also m = m~*. Hence, by Theorem 297, 

/ 1 I /27rv\ |»'\Wp' 

The left-hand side being an approximation to ^p {f), we may 
deduce Hausdorff’s theorem (8.6.6) by passages to the limit*>. 

(ii) If m is again an odd integer, 

(27rfi\ 


»i*=/m 


I W / 
1 


and 


im 

r=»— im 


im 

X„=-7- S 


then simple calculations show that 

and S|Z,ia=s|a;^|» 

as before. In this case Theorem 297 gives 

and Hausdorfi’s theorem (8.6.7) follows by appropriate passages 
to the limit. 

We can also, as we showed in § 8.6, deduce the second theorem 
from the first. 


• We now write ft, v for i^j, and extend the summations over the range 

— Jm <fi< im, 

^ If /(d) is a polynomial iM 

-iM 

then f„ {0) =/ (S) for t» g M, and the theorem for / (0) follows immediately from 
(8.17.1). The extension to on arbitrary f(0) depends on the theory of strong 
convergence*. 
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MISCELLANEOUS THEOREMS AND EXAMPLES 


298. If jp > and a (x, y) is measurable and positive, then the three 
assertions 


(i) j^a(x.y)f{x)g(y)dxdy^K » 

for all non-negative /, g; 

for all non-negative /; 

(iii) dx a(x.y)g(y)dyy 

for all non-negative g, are equivalent. The assertions ‘there is inequality 
in (i) unless / or g is null’, ‘there is inequality in (ii) unless / is null’, 
‘there is inequality in (iii) unless g is null*, are also equivalent. 

[Analogue of Theorems 286 and 287, with q There is a more general 

form with both p and q arbitrary.] 

299. The forms 


A = LS 




t+y- 1-f A’ 


B = SL' 


i-j + \ 


where A> 0, and the dash is required only if A is integral, are bounded in 
[2, 2], and have bounds ir if A is integral, n | coseo Air | if A is non-integral. 
[Schur (1), P61ya and Szego (1, i, 117, 290).] 


300. If p > 1 and A = has a bound M in [p, p'], and 

where / 1/< ^ jn*’, J | \^’dt g v^\ 

then A*^YLanhifXiyf 

has a bound Mfiv» 

[For I = I I 

gilf J{S I *,/,(«) I ytg,(l) 

g Jlf {JS ! !•> |/y(0 1'’*}’/' 1 g,{t) I*-'*}*"' 

SMMSklWSlyyK)'/''. 

For Che eaae see Sohur (1).] 


801. SZ' ^^}****. is bounded in [2, 2]. 


302. Xiyi is bounded in [2, 2], for any real $. IfOg^^ir, 

t-— ^ 

the bound does not exceed Max (^, tt— ^). 

[For the last two theorems see Schur (1).] 
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808. If is the Fourier sine coefficient of an odd bounded function, or 
the Fourier cosine coefficient of an even bounded function, then the forms 

are bounded m [2, 2]. 

[Toeplitz (1). Supx)Ose that i and J run from 1 to n; that x and y are 
real and v • , 

that JST = cos t^, F = cosy^, F^=2t/^siny^p 

and that (for example) 

2 /*«• 

an=- I f(0)8mned$, 

^Jo 

where |/ 1 . A simple calculation gives 

SSo^,a5,y,=- f\x'Y-XY')f(e)d 0 . 

11 ^Jo 


Since 


/: 


XY'f(B)de 




(X* + Y'^) de = i;Mn (Sr2 + Sy®) = tt, 


we find the upper bound 2M for A. Similarly in the other cases. 

If for example /(0) is odd and equal to J ( tt — 0) for 0 < ^ < v, then M=:i7r 
and =n“i. We thus obtain the result of Theorem 294 concerning BJ 

804. If (i) HHaifXiy^ is bounded in [p, g], (ii) Aj > 1, Z > 1, and (iii) (tij, 
(Vf) are given sets for which Sj,i,f{u)<co 9 6gr(v)<«>» 

A = 2Sa<^w<v^a;<2/i 

is botmded in [pA;, ql}, 

[For I A I g M (S 1 ux 1 vy 1«)V« 

^ M (E I u**” 1 )V«*'(E 1 »«»' I )V«''(S 1 X 1**)»/»»(S 1 y I")*/**.] 


885. The form 




where M| and Vf aie given sets of numbers satisfying 

is bounded, but not necessarily absolutely bounded, in [oo, oo]. 

[Take p = g = 2, ifc = 1 = 00 in Theorem 304. 

II the form were always absolutely bounded, then Hubert’s form B of 
§ 8.12 would be absolutely bounded, which is untrue.] 


806. If 3W,(o)g.4,H, 9W4 (o)^-^4H. 


then 

[By Theorems 16 and 17. The result is required in the proof of the 
next theorem.] 
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S07. If perpendiculars are drawn from the comers ( ± 1, ± 1, . , . , ± 1) of 
the *unit cube ’ in spaoe of m dimensionst upon any linear [m — 1] through 
the centre of the cube, then the mean of the perpendiculars lies between 
two constants A and JB, independent of m and the position of the [w— 1]. 

808. If 

then P = {SS|a^,|t)f<iC(S6, + Sc<)=iC(B4-C), 

where ^ is an absolute constant. 

\ 

809. A necessary condition that a form A = with\real or 

complex coeiEoients, should be boimded in [cx), oo], with bound M, is that, 
in the notation of the last theorem, P, C, P should be less than KM. 

[For the last five theorems see Littlewood (2).] 

810. If + 

pq^p^q* ^ 3pq — 2p-~2q* 
and A is bounded in [p, q], with bound M, then 

where and are defined as in Theorem 308, and K depends on p and 
^ only. 

Ml. If + 

but the conditions of Theorem 310 are satisfied in other respects, then 

812. If P<2<q, i+-<l, 

P 9 

and the conditions of Theorem 810 are satisfied otherwise, then 

818. If P>1, J>1, -+-<1, 

P 9 

A is bounded in Ip, g], mth bound M, and 

then 

[For the last four theorems see Hardy and Littlewood (13).} 
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814. Hilbert' 8 forma in [p, p']. It will be proved in Ch. IX that the 
form A of Theorem 294 is bounded in [p, p']. The corresponding theorem 
for B lies a good deal deeper. We have to show that 

(i) ^Ke,{^)eAy). 

1 1 

where K^K{p) depends only on p; or, what is, after Theorem 286, the 
same thing, that 

(ii) 

j ^ J 

It is enough, after Theorem 295, to prove (i) or (ii) for even integral values 
of p' (or for some subsequence of these values). This demands some 
special device, the most natural, from our present point of view, being 
that used by Titchmarsh (2). 


HI 


15 



CHAPTER IX 


HILBERT’S INEQUALITY AND ITS | 
ANALOGUES AND EXTENSIONS 


9.1. Hilbert’s double series theorem. The researches of 
which we give an account in this chapter originate in aremai^kable 
bilinear form which was first studied by Hilbert, and whi^h we 
haye already encountered in § 8.12, viz. the form 


SS 


m+n 


> 


where m and n run from 1 tooo. Ourfirst theorem isTheorem 31 5 
below, which we state with its integral analogue and with a com- 
plement of a type which will occur frequently in this chapter. 


315. If p>l, p'=pl{p-l), 

and 


SS ^ . 7 / 
m+n 8m(w/p) 


(9.1.1) 

unhsB (a) or (6) is nvU. 

816. If p>\, p'=pl{p-l), 

and 
then 

(9.1.2) 




f fP(z)dx^F, f g»'[y)dy^O, 

Jo Jo 

FUpffiip'^ 


0 z+y ^ sin(v/p) 


unless fs OorgsO. 

317. The constant ircoseo {irlp) is the best possible constant in 
each of Theorems 315 and 316. 

The oaaepsp' = 2 of Theorem 315 is 'Hilbert’s double series theorem’, 
imH was proved first (apart from the exact determioation of the oonstaut) 
by Hilbert in his lectures on integral equations. Hilbert’s proof was 
published by Weyl (2). The determination of the constant, and the 
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integral analogue, are due to Schur ( 1 ). and the extensions to general p 
to Hardy and M. Riesz: see Hardy ( 3 ). Other proofs, of the whole or 
of parts of the theorems, and generalisations in different directions, 
have been given by rej6r and F. Riesz ( 1 ), Francis and Littlewood ( 1 ). 
Hardy (2), Hardy, Littlewood, and P61ya ( 1 ), Mulholland ( 1 , 3 ), Owen ( 1 ), 
P61yaandSzego(l,i, 117,290), Schur(l), and F. Wiener ( 1 ). Anumber 
of these generalisations will be proved or quoted later. 

The inequality (9.1.1) is of the same type as the general inequality 
discussed in § 8.2; but Theorem 315 is not included in Theorem 275, since 

m m + n 

is divergent. It is to be observed that tt cosec arr, where a = 1/p, is (in 
accordance with Theorem 295) convex for 0<a< 1. 


9.2. A general class of bilinear forms. We shall deduce 
Theorem 315 from the following more general theorem a-. 

318. Suppose that p > 1, p' —pj{p ~ 1), and that K (x, y) has the 
following properties: 

(i) K is non-negative^ and homogeneous of degree — 1 : 


(ii) 


r K{x,l)xr^Pdx=‘{ K(l,y)y-^^'dy=k-. 
Jo Jo 


and either (iii) K(x,\) x~^^ is a strictly decreasing function of x, 
andK {l,y)y~^^'ofy: or, Tnoregenerally,{m’)K{x,l)x~^^decrea8e8 
from x=l onwards, while the interval (0, 1) can be divided into two 
parts, (0, and (^, 1), of which one may be nvl, in the first of which 
it decreases and in the second of which it increases ; and K ( 1 , y) y~^ 
has similar properties. Finally suppose that, when only the less 
stringent condition (iii') is satisfied. 


(iv) 

Then 

(a) 


K(x,x) = 0. 


SSX (m, n) < h 


unless (a) or (6) is null; 


(b) 


S (sx (m, n) o 

n \m f 


urdess (a) is nuXl\ 

• Hardy, Littlewood, and P61ya (1). The °f the tteorem « duejn 

stance to Schur ( 1 ) : Schur supposes K {x, y) a decreasing 

x5-a 
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S {lK (m, n) bX < 

Tit \n / 


(c) 

(6) is null. 

In each case the summations are from 1 to oo. Theoreihs 286 
and 287 show that the three conclusions (a), (b), (c) are equivalent. 

We may elucidate the hypotheses by the following remarks. \ 

(1) The convergence and equality of the two integrals in condition (ii) 

is a consequence of the convergence of either, because of the homogeneity 
of is:. ' 

(2) The words ‘decreasing*, ... are to be interpreted in the strict sense 
throughout the theorem. 

(3) f may be 0 or 1, one of the intervals (0, and (£, 1) then disappearing. 

(4) In the most important application, in which 

K (x, y) = — ^ , 

' x-^y 

condition (iii) is satisfied. An interesting case in which condition (iii') is 
required is 

In this case iC (a?, 1) has an infinity at a;=: 1. In such cases condition (iv) is 
required in order to exclude equal pairs (m, m) from the summation. 

It is easy to see that, if m and n are positive integers, and the 
summations are over r= 1, 2, ..., then 

( 6 . 2 . 1 ) 


For, if (iii) is satisfied, then 


^(4 ‘IK 




n 


K{x, 


J (r~l)/n 

and (9.2.1) follows by summation. If only (iii') is satisfied, we 
use (9.2.3) for r > n and for r < and 

/t \ 1 r(»*+i)/n. 


• See Hardy, Littlewood, and P61ya (t). 
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for ^n<r<n', and the result again follows by summation when 
we observe that Z(l, 1) = 0. The proof of (9.2.2) is similar. 
Hence 


g pvp Qyp\ 


where P='S:aJ’^K{m,n)(-Y'’ 

tn n \^ / 

n \ m}\m) 


= 2a 


m 




by (9.2.2), unless (o) is null; and similarly 


Q<*S5„p' 

unless (6) is null. This proves the theorem. 

If we take Z(a;,w) = — 

x + y 

we obtain Theorem 315. It may be shown that the h of Theorem 
318 is a best possible constant, but in this direction we shall not 
go beyond proving Theorem 317.* 

9.3. The corresponding theorem for Integrals. The 
theorem for integrals corresponding to Theorem 318 is 

319. Suppose that p > 1 , that K{x,y) is non-negative and homo- 
geneous of degree — 1, and that 

r K{x,l)x-vpdx=^ r K{\,y)y-^^'dy=h. 

Jo Jo 

Then (a) 

/•« /•“ / r* \vp I t'o ,, \vp' 

jJ^K(x,y)f(x)g{y)dxdyik\^jJ^dxj [j dyj . 

(6) J J dy ( J J K (X, y)f{x) dxj < IcP jj^dx, 

(c) jyx(^j"K(x,y)g{y)dyJ ^kp’j'g^'dy. 

If K (a;, y) is positive, then there is ineq'mlity in (b) unless /s 0, 
in (c) unless g^O, and in (a) unless either /s Oorg^O, 

• See § 9.6. The constant k is (again in accordance with Theorem 295) convex 
in asr 1/p. 
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The theorem may he proved by the method of |9.2, which 
naturally goes rather more simply in this case. We have 

J J -S' (*, y)f{x) g (y) dxdy 

fp(x)dxj K{x,y) 



If Jf > 0, and there is equality, then 

(9.3.1) AfP (x) [ly’ ^SgP' (y) 

for ahnost all t/.«- If we give y a value for which g (y) is positive and 
finite, and for which the equivalence holds, we see that (a?) is 
equivalent to a function Cxr^y and this is inconsistent with the 
convergence of Ipdx. Hence either / or g is null. It may also be 
shown that the constant is the best possible. 

There is another interesting method of proof due to Sohur^. 
We have 


where 





/•oo /•» ^00 

f{x)dx K{x,y)g{y)dy= f{x)dx xK(x,xw)g{xw)dw 
Jo Jo Jo Jo 

Too /•« rco roo 

= f{x)dx\ K{l,w)g(xw)dw= \ K{\,w)dw\ f(x)g(xw)dx 
Jo Jo Jo Jo 


(if any of the integrals is convergent). Applying Theorem 189 to 
the inner integral, and observing that 


jgp'(xw)dx=^j 




we obtain (a); and (b) and (e) are oorolhuies, by Theorem 191 . 

The case K(x,y)==l/(x+y) gives Theorem 316. We shall 
return later (§9.9) to other applications. 


* That is to saj it is trne^ for almost all y, that the two sides of (9.3.1) are 
equal for almost all x. See § 6.3(d). 

8chur(l). Schur supposes pss 2. 
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9.4. Extensions of Theorems 318 and 319. (1) The follow- 
ing theorem is in some ways more and in others less general than 
Theorem 318. 


320. Suppose that K (a;, y) is a strictly decreasing function of x 
and y, and also satisfies the conditions (i) and (ii) of Theorem 318; 
that A„| > 0, > 0, 

Aot = Ai + A2+ ... + A^, M^ = /Xi 4-^X2+ 
and that p>l. Then 

ZY.K (A„, M„) < k 

urdeas (a) or (b) is null^. 

The special case A„=m, M„=to is also a special case of 
Theorem 318. 

We deduce Theorem 320 from Theorem 319 by a process which 
has many applications'*. We interpret A® and Mq as 0, and take, 
in Theorem 319, 

f(x) = Kr^^<^m (A„_iga;<A„,), 

If we observe that 


J Am— 1 J Mfi— 1 

unless 0 or 6^= 0, we obtain Theorem 320. 

If Z (a;,y)= ll{x+y) we obtain® 

unless (a) or (b) is null. 

(2) Theorems 318 and 319 may be extended to multiple series 
or integrals of any order. 

ggy.g Swppoae that the n nwmbers p, q, r satisfy 

1 1 .l_i. 

p>l, g>l, —f ’ 


t 

0 

4 


}r the case p=2 see Schar (1). 
r S 6 4: and see, for example, § y.ii* 

ortlieoMei)=4=-='^*®*®°““' ’ 
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thit K (x,y,...yZ) is a positive function of the n variables a:, y, . . . , 2 , 
homogeneous of degree ~ + 1 ; and that 

poo poo 1 

(9.4.1) ... K{l,y,...,z)y i...z rdy...dz=k. . 

J 0 J 0 I 

Then 

^00 1*00 ^00 

... K{x,y, ...,z)f{x)g{y) ...h(z)dxdy ... dz 
J 0 J 0 Jo 

a <x> \Vp / poo \llq / poo \l 

JPdx) 

If further 

y-^«...z-^K{l,y z), x-^'p ...z-^K{x,l,...,z),... 

are decreasing functions of all the variables which they involve, then 
SS ... UK {m, n, ...,«)a„6„ ... c.^h (S6„9)V« ... (Sc/)^-. 

In virtue of the homogeneity of K, the convergence of (9.4.1) 
implies the convergence and equality of all tibe n integrals of the 
same type. 

Theorem 322 may be proved by straightforward generalisations 
of the proofe of Theorems 318 and 319. 

9.5. Best possible constants : proof of Theorem 317. We 

have still to prove Theorem 317, which asserts that the constant 
wcosec (tt/p) of Theorems 315 and 316 is ‘the best possible’, that 
is to say that the inequalities asserted by the theorems would be 
false, for some a^, or /(*), giy), if wcosec (v/p) were replaced 
by any smaller number. The method which we use illustrates an 
important general principle and may be used in the proof of many 
theorems of this ‘negative’ character. 

We take 

where e is small and positive; we may suppose that e<p 72 p. 
We denote by 0 (1) a number which may depend upon p and e, 
but is bounded when p is fixed and 6-»-0; and by o (1) a number 
which satisfies these conditions and tends to zero with e. Then 

1 r* ju 

-= I d* < < 1 + xr^~*dx=l+-, 

e Ji 1 Ji c 

and so 

(9.6.1) Sa*;==Sm-i-‘=-* + 0(l), UbP'=-^ + 0{l). 
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Also 

m + n J 1 J 1 x-^y 

/•go /•oo 

Jl Jl/x 1 + ^ 

The error in replacing the lower limit in the inner integral by 
0 is less than a;‘”“/a, where a is positive and independent of and 


Hence 


1 p 

aj 1 




(9.6.2) SS^^> f 

m+n Jl J( 


^ /I'll 

1+?^ 
'IT 


= +o(l)l + 0(l) = M^-^^^ + o(l)|. 

e\Bin{irlp) ') e {sin(jT/p) ) 

It is plain from (9.5.1) and (9.5.2) that, if k is any number less 
than ircoseo (nip), then 

S2 > Ic (SaP)i/P 
m-\-n 


when € is sufficiently small. 

This proves that the constant in (9.1.1) is the best possible. 
Since (9.1.1) can be deduced from (9.1.2) as in §9.4, it follows 
that the constant in (9.1.2) is also the best possible. We could of 
course also prove this directly. 

An alternative method is to take 

when ngp, and a„=0, 6„=0 otherwise, and to make m 

tend to infinity. The principle is the same in each case; we make 
o_ and depend upon a parameter (e or p) in such a manner 
that the series involved tend to infinity when the parametertends 


ft It is less than 


wl^ere 


/ 


l/x 

^•~ii+9)IP'du = 

0 


T' 


i8«l- 


1+c 

p' 


1 

p 


€ 



and we may take a* l/2p, if c <p* ftp* 
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to a limit, and compare their values for values of the parameter 
near this limit. The method is effective in the proof of very 
many theorems of the type of Theorem 317. The inequalities 
(9.1.1 ) and (9.1.2) assert unattained upper bounds; except when 
both sides vanish, equality cannot occur; and it is for this r^on 
that the introduction of a parameter (e or n) is necessary ip the 
proof of the complementary theorem. 


9.6. Further remarks on Hilbert’s theorems*^. Theo^ms 
315 and 316 have been proved in many different wa3rs and mve 
very varied applications. In this and the next section we collect a 
number of remarks which concern both proofs and applications, 
and are intended to illustrate the connections between the 
theorems and various parts of the theory of fxmctions. 

(1) Theorem 315 may be deduced from Theorem 316 by the 
process which led us to Theorem 321. Wedefine/(x) and g{y) by 
f{x)-a^ {m-l^x<m), 9{y) = bn (n-l^x<n), 
and observe that then 

■ p 

-iJn-i *+» ^ m+n 

Here, however, we can go a little farther, since 

m+n— 1— a m+n—l+x m+n— 1 
for 0 < a < 1, and so** 

r™ dxdy 1 

Jw-Jn-i x+y^m + n-1' 

If now we replace m and n by m+ 1 and n+ 1, we obtain a 
slightly sharper form of Theorem 315, viz. 


328. If the conditions of Theorem 315 are satisfied, then 


2 2 ^ 

0 0 m+n+l 


< 


TT 

sin {trip) 



* We desoribe Theorems 315 (together with the sharper Theorem 323) and 316 
as * Hilbert’s theorems’. Strictly, Hilbert’s theorem is Theorem 315, with p>-i2. 

^ Associate elements of the integral symmetrically situated about the centre of 
the square of integration* 
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Several other proofs of Hilbert’s theorem, for example the proof of 
Mulholland (1), the proof of Schur given in § 8. 12, the proof of Fej^r and 
F. Riesz given below, and the proof of P61ya and Szego (1, i, 290), also 
give the result in this form. The last three are limited to the case p = 2. 

(2) The proof of Fej6r and F. Biesz is based upon the theory 
of analytic functions, and proceeds as follows. Suppose that 
/(z) = Sa^ 2 ;^ is a polynomial of degree N with non-negative co- 
efficients, not identically zero. Then, by Cauchy’s Theorem, 


J f^(x)dx= —i j P (e^^) ef'^d6\ 

and so 

(9.6.1) f P{x)dx< I* /2(a:)da;gjf \f(e^)\’^dd, 
Jo J -1 J 


2 2 

0 fl tn-i-n + 1 


N 

: IT Sa$ . 


If we N-*co we obtain Hilbert’s theorem, with a^’=b„ 

and ‘ ^ ’ for ‘ < ’. The first restriction is unimportant, since, 
after § 8.8, a symmetric bilinear form in [2, 2] has a bound equal 
to that of the corresponding quadratic form. To replace * g ’ by ‘ < ’ 
requires a refinement of the argument which we shall not discuss 
here. 

The second inequality in (9.6.1) may be written 

^\f[x)\^dxi^j''_U(e^^)\^d9, 

and in this form it is valid whether the coefiBcients a„ are real or 
complex, and has important function-theoretic applications®. 

(3) Hilbert’s original proof depended upon the identity 

(9.6.2) j'' t |s( — l)’'(a,cosrt— br8Uirt)| dt=2ir(S— T), 

where ^-,ir+s’ TT 

(the dash implying that pairs r, a for which r=« are omitted). From this 
it follows that a » » . 

(9.6.3) 2.,lS-!r|^e-|'_^{s(-l)'(o,oosrt-&,emr<)[ *=**S(a*+6r)- 

. See Fej^r and F. Riesa (1). The 
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If 0,= 6ri T disappeaxs, and we obtain 

(9.6.4) 

and from (9.6.4), and the remark of § 8.8 quoted under (2), we deduce 
(9.6.6) + 

From (9.6.3) and (9.6.6) it follows that 

and henoe, on grounds of homogeneity, that 

This gives the second resiilt of Theorem 294, except that the constant 2ir 
is not the best possible constant. 


9.7. Applications of Hilbert’s theorems. (l)ABanapplica- 
tion of Hilbert’s theorem to the theory of analytic functions we 
select the following. Suppose that /(z) is regular in ] z | < 1 and 
belongs to the ‘complex Lebesgue class L\ i.e. that 

is bounded for r< 1. If/(z) is ‘wurzelfrei’, i.e. has no zeros in 
|z| < 1, then y(j;)^ 2 c„ 2 »=sr 2 (z) = (Sc„z")®, 

where g(z) also is regular in |zl<l. Since i\g{re*^)\^dd is 
bounded, S lo„ |* is convergent, and therefore, by Theorem 323, 

2j«ml 

TO + »+ 1 

is convergent. A fortiori 



2 


is convergent. 

It is fairly easy, by a method which is familiar in this part of 
the theory of analytic functions, to extend the conclusion to 
general / (not necessarily ‘wurzelfirei’)*. We thus obtain the 


• See F. Biesz (3), Hardy and littlewood (2). We can ezpreaa / as the euin of 
two ‘wonelfrei’ functions of L. 
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theorem: if f(z) belongs to L in 1 2 1 < 1, then its integrated ptrwer 
series is absolutely convergent for j 2 j = l.a 

(2) As an application of Hilbert’s series theorem to the theory 
of functions of a real variable, we prove 

324.t> Iff(x) is real, and not null, in (0, 1), and 
an= (n = 0, 1,2, ...), 



The constant is the best possible. 

We may plainly suppose f{x)^0. Then, if (bj is any non- 
negative and non-null sequence, 

^a„b„=i:b„j^x'‘f(x)dx=j^ (n>^x’^)f{x)dx, 

C {^b„x-)^dxl' fHx)dx 
Jo Jo 

by Hilbert’s theorem. The result now follows from Theorem 15. 

To prove the constant w the best possible, consider 

f(x) = (l-xy-i, 
and make c tend to zero. 

The integrals a^ are called the moments of f{x) in (0, 1) and are 
important in many theories. 

Here we have deduced Theorem 324 from Theorem 323 (with 
p = 2) and Theorem 15 (the converse of Holder’s inequality). We 
can, if we please, reverse the argument, deducing Theorem 323 
(with p=2) from Theorem 324 and Theorem 191 (the integral 
analogue of Theorem 15). Suppose that g (x) = where b^ is 

* Hardy and Littlowood (2). The theorem may also be stated in the form *if a 
power series is of bounded variation in |2l < 1, then 2|&nl » conver- 

gent’. For this form of the theorem, and for more precise results, see Fej^r (1). 

^ A much more general inequality, but without the best possible value of the 
constant, is proved by Hardy and Littlewood (1). See also Hardy (10). 
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non-negatiye and not always 0, and that /(a;) is any non-negative 
and non-null function. Then 

r fgdx= f (S6„a:’*)/da:=26„ f x‘^fdx=lM„b„, 

Jo Jo Jo 

by Theorem 324. Since this is true for all /, it follow^, by 
Theorem 191, that \ 


j: 


g^dxKnlib^^; 


\ 


and this is equivalent to Theorem 323. 

It is plain that when two inequalities, each involving a con- 
stant factor, are ‘reciprocal’ in this sense, each being deducible 
from the other in this way by the converse of Holder’s inequality, 
then one constant must be best possible if the other is. We shall 
meet with another application of this principle later (§ 9.10 (1)). 

(3) As a corollary of Theorem 316 (with p = 2), we prove 

325.^ Suppose that and that the summations run from 0 toco, and 
that 

(9.7.1) 

Then 


A(x) = 'E^„x\ 


(9.7.2) 

(9.7.3) 


SL 


m+n+ 1 ” 


TT S S 


{rn-\‘n)\ a^ a^ 
m\n\ 2 ’"+"+!' 


r A^{x)dx^'n f {e-‘^A*{x)ydx. 

Jo Jo 


It may be verified at once by expansion and term-by-term integration 
that (9.7.2) and (9.7.3) are equivalent. 

To prove (9.7.3) we observe that 


A(x)=f e-'*A^(xt)dt=- f e^f*A*(u)du 
Jo ^Jo 

p A^(x)dx=: e’~^l^A*{u)di^ =j^ 


and so 


where 


a(w) = e'^il*(w). 

• Widder (1), Hardy (9). 
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This is 

j dwj e”"***® a {u) du j 0 L{v)dv=:j j dudv 

a® (w) dw = TT^ {er^A* (u)Y du, 

by Theorem 316. 

It is easy to see that the constant n is the best possible. The relations 
between the functions (9.7.1) are important in the theory of divergent 
series, particularly in connection with singularities of analytic functions. 


9.8. Hardy’s inequality. The two theorems which we discuss 
next were discovered in the course of attempts to simplify the 
proofs then known of Hilbert ^s theorems^. 

We might require only an imperfect form of Theorem 315: the 
double series is convergent whenever and are convergent. It 
would then be natural to argue as follows. We divide the double 
series into two parts 8^ by the diagonal m=r^, and consider 
the part 8^ in which m^n. Then 




^ 2 s S — 6, 


m^n rri’^n 




where 

Since is convergent, the last series is convergent whenever 
is convergent, and, to prove the convergence of 8i, it 
is enough to prove that the convergence of the last series is a 
consequence of that of SgJ. The convergence oi 8^ could then be 
proved in the same way. 

This line of argument leads up to and is completed by the 
following theorem. 


826. //^?> 1, + + 



uriUsB all the a are zero. The constant is the best possible. 


• It ww> » ooMiderable time before any really rimple proof of Hilbert’s double 
Beries theorem was found. 
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The oorresponding theorem for integrals is 

827 .» Ifp>l,f{x)'^Q,aniF(z) = ^ f{t)dt,^n 

unless 0. The constant is the best possible. 

These theorems were first proved by Hardy (2), except that Hardy 
was unable to fix the constant in Theorem 326. This imperfectic^n was 
removed by Landau (4). A great many alternative proofs of the the^rexns 
have been given by various writers, for example by Broadbent (1), 
Elliott (1), Grandjot (1), Hardy (4), Kaluza and Szego (1), Knopp (1). 
We begin by giving Elliott’s proof of Theorem 326 and Hardy’s proof 
of Theorem 327.** 


(i) In proving Theorem 326 we may suppose that Ox > 0. For 
if we suppose that 0, and replace by 6^, (9.8.1) becomes 



(V+V+— ). 


an inequality weaker than (9.8.1) itself. 

Let us write for AJn, and agree that any number with 
sufiix 0 is 0. We have then 


“n® - - (» - 1 ) a„_i} 

s «.* (i -^) {(? - 1) «."+ 

^ n ^ No. P 

Hence 

1 P^^ 


* We have already encountered thfs theorem in Ch. VII, but the proof which we 
gave there (in detail only when p==2) was intended primarily as an illustration of 
variational methods and has no particular pretensions to simplicity. 

^ We have expanded the proofs so as to deal with the question of equality, 
c By Theorem 9« 
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and therefore, by Theorem 13, 

(9.8.3) ^ 

Dividing by the last factor on the right (which is certainly 
positive), and raising the result to the pth power, we obtain 

N / 7) \P^ 

(9.8.4) ?“»’• 

When we make N tend to infinity we obtain (9.8.1), except that 
we have ‘ ^ ’ in place of ‘ < ’. In particular we see that ’Zo^p is 
finite. 

Returning to (9.8.3), and replacing N by oo, we obtain 

(9.8.6) Za^P g ^ g ^ (Sa„i>)Vi> (Zcc^p)Vp\ 

There is inequality in the second place unless (a^^) and (a,^^) 
are proportional, i.e. unless a^ = C(x,^, where G is independent 
of n. If this is so then (since ai = ai> 0) (7 must be 1, and then 
for aU n. This is only possible if all the a are equal, 
and this is inconsistent with the convergence of Hence 


(9.8.6) Za„P < ^ {Za^^)^p (Zx^P)^iP ' ; 

and (9,8.1) follows from (9.8.6) as (9.8.4) followed from (9.8.3). 
To prove the constant factor the best possible, we take 

{nSN), an=0 (n>N). 

Then = 

1 ^ 


A^=Zv-Vp> j’'x-VPdx=^^-{vfP-^-l] {n£N), 


where ->-0 when n->oo. It follows that 
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where tjjf-* 0 when N~*-co. Hence any inequality of the type 

is false if is chosen as above and N is suffidently large. 

An alternative procedure is to take for all », and 

to make e small. Compare § 9.5: it was this procedure which we 

followed there. \ 

(ii) We may suppose / is not null. Let n > 0, /„ =Min (/, n), 

f* \ 

= J /„d», and let Xq be so large that f, and so /„, are not 

null in (0, X) when X > X^. We have 




sinoe the integrated term vanishes at *=0 in virtue of F^=o{x). 
Hence 

The left-hand side being positive (and finite), this gives 




We make n->oo in this, the result being to suppress the two 
suffixes «; now making X ->• oo, we have 


/.■©•-‘WT"' 


the desired result with for Making n-»-oo and then 
X->oo in (9.8.7) we have 
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The integrals in this being now known to be all finite and positive, 
(9.8.8) gives 



unless x-PF^ and /» are eflfectively proportional, which is im- 
possible, since it would make/ a power of x, and Jf^dx diver- 
gent. 

The proof that the constant is the best possilble follows the 
same lines as before; take/(a;) = Ofora:< = for 1 . 

Elliott’s proof of Theorem 326 applies to Theorem 327 also, with the 
obvious modifications. The proof of Theorem 327 given in (ii) may be 
adapted to series, but does not give the best possible value of the constant. 

(iii) The following proof of Theorem 327 (due to Ingham) is also 
interesting: we shall be content with proving the form with ‘ ^ ’. We use 
Theorem 203, supposing that the intervals of integration are each (0, 1), 
that the weight-functions are 1, and that 

f=:l, «=p>l, f{x,y)=f(xy). 

Then = fj{xi/)dx=^^K 

for a; ^ 1. Hence, by Theorem 203, 

We then obtain the result by putting 

a;=X/c, f{Xlc) = g(X), 
replacing X, g by a;, /, and making o cx). 

9.9. Further integral inequalities. There are many ana- 
logues and extensions of Theorems 326 and 327, which have been 
proved by different writers in different ways; and we give some 
of these theorems here. We consider integral inequalities first, 
since we can derive most of these in a simple and uniform manner 
from Theorem 319, and the corresponding theorems for series 
sometimes involve slight additional complications. 

(1) Take, in Theorem 319, 

K{x,y)^\ly (x^yl K{x,y)^0 (x>y). 
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Then, if 3 )> 1, 

k={ K{x,l)xr^Pdx= { xr^dx^-^—, 

Jo Jo p—i 

and all the conditions on K axe satisfied. Hence (6) and (c) of 

Theoxem 319 give 

and 

Of these inequalities, (0.9. 1 ) is Theorem 327, with ‘ g ’ for ‘ ; 

we cannot quote * < ’ from the general theorem because K is 
not always positive. But equality in (9.9.1), with a non-null/, 
involves 

jjK(x,y)f(x)g(y)dxdy=^^(^ jfPdi^ {^^gP'd^'^ , 

with non-nuD / and g. The argument of §9.3 then shows that 
(9.3.1) is true for x<y, and that for small x, which is 

inconsistent with the convergence of If^dx. 

Similarly we can prove that there is inequality in (9.9.2) unless 
g is null. A trivial transformation then gives 
828. // p > 1 and 

F{x)=jy(t)dt, 

ihm 

(9.9.3) IJ FPdxKpP (xf)Pdx, 

unless fs 0. The constant is the best possible. 

(2) More generally, take 

•g(g>y)=r^ ^ (^<y)> F(x,y)=n0 (x^y) 

with r > 0. With r= 1, we come back to (1). We now have 
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We are thus led to 

329. If p>\yr>(i, and 


(9.9.4) = 


(9.9.6) 




■^—4:' f /•&. 

r+l-i|J 


unless f^O. If 

(9.9.6) /r(*) = i^Jj<-a:)’-V(0<i<, 


(9.9.7) 


f- f ''(?) I’f 

fPdx<i—^\ r 


(3ff)^dx, 


unless fs 0. In each case the constant is the best possible. 


The function /, (x) of (9.9.4)i8 the ‘Biemann-LiouTillemtegral’* 
oif{x) of order r, with ‘origin’ 0. The function (9.9.6) is the ‘ Weyl 
integral’ of order r, which is in some ways more convenient, 
especially in the theory of Fourier series. 


(3) Take 

K{x,y) = ^ (x^y), K(x,y) = 0 (x>y), 
with a< 1/p'. Then 

fc= / <!*=-—— — i . 

Jn p-pa-1 

and (b) and (e) of Theorem 319 give 

(9.9.8) »»•(-« (^j*xr*f(x) ^^^y<{ p_^p Z - i ) 

(9.9.9) 

Changing the notation, we obtain 
880. > 1, r #: 1, and F (x) w defined by 

F(x)=: jj{t)dt (r> 1), F(*)=JV(0* (»•< 1)» 

• See § 10.17. Part of Theorem 329 is proved by Knopp (3). 
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then 

(9.9.10) 1*01-^ (a^)» das, 

urdeeafsO.* 

The coziatant is the best possible. It is also easy to verify that, when 
pss 1, the two sides of (9.9.10) are equal. j 

9.10. Further theorems concerning series. Amons the 
analogues and extensions of Theorem 326 we select the follot^g. 

(1) The theorem related to Theorem 326 as Theorem 328 Is to 
Theorem 327 is '■ 


881.*> If p> I, then 

s K + o»+i +...)*’< J?** S (na^lP, 
unless (oj is null. The constant is the best possible. 

This theorem is ‘reciprocal’ to Theorem 326 in the sense of 
§ 9.7 (2), i.e. deducible from the latter theorem by the converse 
of Holder’s inequality. It may be instructive to set out the 
proof in detail, although what we say amounts to a repetition in 
a special case of what we have explained more generally before 
If K {x, y) is defined as in § 9.9 (1), then 


(9.10.1) 

'L'LK{m,n)aJ}^= = 

msn » n ” » ” 


by Theorems 13 and 326, unless (a) or (6) is nul. 
On the other hand 


and the maximum of this, for all (a) for which 
Theorem 15, 



m+1 


n w 

■ 


1, is, by 


• For a direct proof see Hardy (5), 
b Ck>p8on (1); Bee also Hardy (6). 

• See § 8.7. 
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Hence, by (9.10.1), 


fv I 
1 \m'^m+l 







Changing into ma^, and p' into p, vre obtain Theorem 331 . 

That the constant is the best possible follows from the 
last remark of § 9.7 (2). 


(2) 832. J/p > 1, o„ ^ 0. A„ > 0, a«d 

■^n + ^2 ■t' • • • + A,, , A.^ ~ + A2flt2 + . . . + A„(Z„ , 


then SA„ 

unless (<i„) is null. 





This theorem, which is related toTheorem 326 as Theorem 321 
is related to Theorem 315,maybeprovedin variouswajrs. In the 
first place, it may be deduced fromTheorem 320 by a specialisation 
of K (as Theorem 327 was deduced from Theorem 319 in § 9.9); 
but the question of possible equality then needs a little attention. 
Perhaps the simplest proof is by a direct adaptation of Elliott’s 
argument in § 9.8, If a„ = , we find that 

jA,***!** j *»*’)» 

and the proof may be completed as in § 9.8.^ 

The theorem may also be deduced from Theorem 327 by taking 
/ to be an appropriate step-function (the process by which, in 
§ 9.4, we deduced Theorem 320 firom Theorem 319). We shall not 
set this out in detail'* ; but the remark raises questions of which 
we say something more in the next section. 


9.11. Deduction of theorems on series from theorems 
on integrals 0. The process of deduction just referred to, and 
actually used in §9.4, is very natural and often effective. It is 
however apt to lead to difficulties of detail, so that direct methods 
are usually preferable. We illustrate this by giving a deduction of 

• For the details 26# Cop80Ji (1). 

» For the (which ere rather tronbleaome) aee Hardy (4). 

* ComjMie $§ 6.4 and 9.4 (1). 
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Theorem 326 firom Theorem 327, which leads us incidentally to 
a remark of considerable intrinsic interest. 

We observe first that it is efficient to 'prove Theorem 326 on the 
hypothesis that a„ decreases as n increases. This follows from a 
theorem which is of sufficient interest to be stated separajbely. 

833. If the a„ are given except in arrangement, and is a 
positive increasing function of u, then 




is greatest when the are arranged in decreasing order. 

To prove Theorem 333 we have only to observe that, if v>fi 
and a^>a^, the effect of exchanging and a^ is to leave -4^ 
unchanged when n < /x orn ^ v, and to increase when /x ^ w < v. 
The theorem is one of a type which we shall discuss in much 
greater detail in Ch. X. 

Suppose now that a^ decreases, and that Theorem 327 has 
been proved. We define f{x) by 

Then 




(9.11.1) 2a„ 

Ifn<a;<n+1, then 

J’(a;) Oi+Oa +... + «„ + (a;- n)o„+i _ .d„-na„+i + a!g„+i 


X 


X 

^»-«a»+iSO, 


X 


and 

so that F/x decreases from AJn to A„+y{n + 1) when x increases 
firom » to »+l. Hence 


and so 
(9.11.2) 


— > 

X “n+ 1 


(n<x<n+l) 




Theorem 326 now follows firom (9.11.1), (9.11.2), and Theorem 
333. 
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If the reader will try to deduce Theorem 331 from Theorem 
328 similarly, he will find some difficulty. Something is lost in the 
passage from integrals to series, and it is by no means alwa 3 rs 
that (as here) the passage can be made without damage to the 
final result. 


9.12. Garleman’s inequality. If in Theorem 326 we write 
for a^, we obtain 

If we make p->oo, and use Theorem 3, we obtain 
S{aia2...aJV^^eSa^; 

and this suggests the more complete theorem which follows. 


3S4.a S < e Sa^, 

unless (a^) is nulL The constant is the best possible. 

It is natural to attempt to prove the complete theorem by 
means of Theorem 9; but a direct application of Theorem 9 to 
the left-hand side of (9.12.1) is insufi&cient^. To remedy this, we 
apply Theorem 9 not to but to c-^a^y c^a^y c^a^y 

and choose the c so that, when is near the boundary of 
convergence, these numbers shall be ‘roughly equal* . This requires 
that shall be roughly of the order of n. 

These considerations suggest the following proof. We have 


gS (c^Ca ... S 

n TtmSn 


— S 

m 




• Carleman (1). The proof given hero is due to Polya (2), The less precise 
convergence theorem (without the constant e) was found independently by other 
writcuFS, and there are a number of proofs of one form or the other of the theorem. 
See Oollingwood (in Valiron, 1, 186, where there is a proof due to Littlewood), 
Kaluza and SKeg5 (1), Knopp (1), Ostrowski (2, 201-204). 


b 


S a„=: 


but the right-hand side is generally divergent. The proof fails because the a in 
Oj ... Uyi are ‘too unequal*, and too much is lost in replacing ®(u) by 
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In Older that the inner sonunation should be easily efieoted, 

Treohoose (cic,...cjv*=«+l, 

when 

(m+l)™ 


Cm = - 


nt 








na«»(«+l) *»’ 


and then 




• Hidt 


•('•a' 


<eZa, 




by Theorem 140, unless a„ is null. 

We can prove the constant best possible as in § 9.6. We may, 
for example, take a„ = 1/n for n ^ /m, a„ = 0 forn > ju., and make n 
tend to infinity. 

The corresponding integral theorem is 
336.* Iffis not nvU, then 

J expj^j* log f{t)dt^dx<ej f{x)dx. 


9.13. Theorems with 0 < p < 1 . We have supposed so far that 
the parameter p involved in our theorems is greater than 1. 
A good many of them, however, have analogues with a p less 
than 1, and we give a selection of them in this section. The 
characteristio difiierence between the two cases lies (as is to be 
expected after our experience with Holder’s and Minkowski’s 
inequalities) in a reversal of the sign of inequality. 

(1) 336. If K (x, y) is nan-negaJtive and homogeneous of degree 
— 1, 0 < p < 1, and 

I* K(x,l)x-VPdx^ r K(l,y)y-^P' dy—k <cOt 

Jo Jo 

then 

^00 ^flo / rtt> \ifp / ^00 \Vp* 

(o) J 0 J ^ («, y)/(*) g(y)dxdy'S.kyj Jp dx^ y ^ tff’dy^ , 

(b) d/y ^ K (*, y)f(x) i® j** ^h^j^pix) dx. 

Here, in aooordanoe with the conventions of §§6.1 and 6.6, 

* Knopp (1). 
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(a) means ‘if the double integral, and the second integral on the 
right, are finite, then the first integral on the right isfinite^and ; 
and (6) means ‘if the integral on the left is finite, then the 
integral on the right is finite, and ... 

If we use the second method of § 9.3, the proof of (a) is the same 
as that of (a) of Theorem 319. The sign is reversed because we use 
Holder’s inequality with p < 1. To deduce (6) from (a), we appeal 
to Theorem 234. We leave it to the reader to frame the corre- 
sponding theorem for p < 0, and to consider the question of equality. 

We cannot take X = l/(a; + y), since then fc = oo. There is there- 
fore no exact analogue of Hilbert’s theorem. 


/; 


(2) 337. 7/0<i5<l,/(a;)^0, 

O 

pdx«x>, 

\ 

and 7’(a:) = J S{t)dt, 

unless 0. The constant is the best possible. 

We may deduce Theorem 337, in an imperfect form, from 
Theorem 336, by taking 

K{x,y)^0 {x<y), K{x,y) = ^ 


when 



x~VPdx= 

l-p 


To prove the complete theorem in this way would involve a 
discussion of the sign of inequality in Theorem 336 (and so in 
Theorem 234). We therefore follow a direct method analogous to 


that of §9.8. 

We may suppose 



dx 


finite, sinoe otherwise there would be nothing to prove. 
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We have 


Since F decreases as z increases, 


tends to 0 both when *->0 and when aj->-oo. Hence (O.is.l) 
gives in the limit \ 



We leave the completion of the proof to the reewier. 

For a more complete result, corresponding to Theorem 330, see 
Theorem 347. 


(3) Finally we prove a theorem which is related to Theorem 326 roughly 
as Theorem 337 is related to Theorem 327. The correspondence is not quite 
precise, and the theorem illustrates very instructively the slight additional 
complications which are sometimes inherent in a theorem concerning series. 


838.» If 0 <p < 1 and < oo, then 




urdess (a„) ia null. The dash over the summation on the left-hand side implies 
that the term for which n^listohe multiplied by 


The constant is the best possible. 
In Theorem 337, take 


1 + 


1 

1-p- 


f(x)=:0 (0<x<l), f(x) = an (0<n^a;<n+l). 
Then, if 0<n^x<n-h 1, 

F_ (n-hl-a;)an4-gn^-i + -» ^ qn + gn+iH-..> 

X X ^ n * 


Hence 

while 



Jo 1— p 


and the result follows from Theorem 337. 


* The Bubstanoe of this theoiem woa oommunioated to ns by Prof. Elliott in 1927. 
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Some Buoh gloss as that contained in the last clause of the theoremis 
necossary ; the result is not necessarily true if the dash is omitted*. 


9.14. A theorem with two parameters p and g. We con- 
clude this chapter with a theorem which, although again an ex- 
tension of Hilbert’s theorem, has peculiaritieB which do not occur 
in any of the earlier theorems of the chapter. It involves two 
independent indices p and g and an undetermined constant 
K(p,g). 

339. 7/ P>1. g>l, = 

so that 0<A=2-i — + 1, 

p q p q'- ’ 


then 


22 


1 1 


m n ^ rr 

(TO+n)^“ 



where K=K(p,q) depends on p and q only. 


This theorem reducestoTheorem315wheng=p', A= 1: in that 
case we know the best possible value of K. The best value has not 
been found in the general case, and the problem of determining it 
appears to be difficult. We shall prove later (§10.17) a deeper 
theorem in which A< 1 and m-(-» is replaced by jm— n] (equal 
values being then excluded from the summation). 

It is sufficient to prove that, if Ya^=A, then 


and for this, by Theorem 13, that 
(9.14.2) 

m 

6. 


where 

Now 
andj>'^g. Hence 


n^m 


2i3„»' g 2 


* Take Oj = 1, a,*ag= ... =0. Then the leeult is false if p > For an altematire 
form of the result see Theorem 345. 



264 


HILBBBT’S IKBQtXALITT AND 


But 

and 

Hence 




/S, \^® 

(H 




p’-q 


+ 2— j)'A=0. 




by Theorem 326. This proves (9.14.2). 
Similarly we prove 


840. Under ihe same conditions as those of Theorem 339 


MISCELLANEOUS THEOBEMS AND EXAMPLES 


841. If (i) a„is g(y) are non-negative, (ii) the summationa go 

from 1 to 00 cuid the integrations from 0 to oo, 

(iii) (So«»)i/*=4, = {Spdxflf = F, ijg‘^dy)^’''=a. 


and (iv) jp > 1, then 

( 1 ) 


LS 


Max (m,n) 


<pp'ABy 


(2) 

unless (a^) or (6„) or f (a;) or g (y) is null. The constants are the best possible. 

[Cases of Theorems 318 and 319, (a). In order to shorten the statements 
of the following theorems we agree that conditions (i), (ii), and (iii) are 
pre-supposed in all of them; and that, whenever the conclusion is ex- 
pressed by an inequality 

X<KY (or X>KY), 

with a definite X, then K has its best possible value (unless the contrary is 
stated explicitly) and equality is excluded unless a sequence or function 
involved in the theorem is null. 

When, on the other hand, the conclusion is 

X^KYg 

with an unepeciJUd X, then X is a function of any parameters of the 
theorem.] 
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842. If j>> 1, tiiea 

(1) o«6i,<ir*coseo*^.4B, 

ir 

(^) ^ ^^^^f{^)9(y)dxdy<7!^ooBeo*^,FO. 

[Also cases of Theorems 318 and 319, (a). Here 

f rr’^iC”^^®d»=7r®co8eo*-.] 

Jo 1 p 


848. If j7 > 1, then 

8 2 mnlogmn sin(7r/p)V2 ^ / t\^) * 

[MulhoUand (2). Since 


, m+1 1 

® m m 

the result is slightly stronger than that obtained by taking A^i^logm, 
Mn = logn in Theorem 321.] 


844. If 0<p < 1 then 

S (a„ + On+i + • • • )' > S (na„)». 

[Copson (2). This theorem, with Theorems 326, 331, and 338, forms 
a systematic set of four.] 

845. If 0 <p < 1 then 

[Corollary of Theorem 344. Compare Theorem 338; here there is no 
gloss, but the constant is less favourable and is presumably not the best 
possible.] 

846. If (a) o>l, ««=ai + Oa + . .. + »„, or (6) c<l, «n=<*n + ®n+i + 
then 

(a) Sn-^«n*^^iCSn-«(naJ*» (p>l), 

(p) J:nr^8n^^KJ:n-^{na^)^ {0<p<l). 

[In each of the four cases il = H (p, c), as laid down under Theorem 341 • 
See Hardy and Littlewood (1). 

We prove (a) when c> 1. If 

^„=:n-« + (n+l)-'® + ..., 

then Hence, if we agree that ao=0, we have 

Sn-o »„«« = 2 - ^„+i ) ^ S { *,* - W) 

11 1 

and («) foUowB. Hardy and littlewood (2) give ftmction-theoretio 
appUcations of («) and O). The important case is that m which o= Z.] 
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847 . If r and jP* satisfy the conditions of Theorem 330, but 0 < < 1 , then 

[Hardy (5).] 

848. If <r{y)=Za„e-”l» 

asidp>l, then 

[Take K (x, y) and apply Theorem 319, (6). More genial but 

less precise results are given by Hardy and Littlewood (1), and some 
function-theoretic applications by Hairdy and Littlewood (2).] 

849. If An and A* satisfy the conditions of Theorem 332, then 

EAn ... ® ^lAn^n • 

[See Hardy (4).] 

860. Ifp>l, ^(a?)>0, and 

IK{X) X*~^dx=:^<l>(8), 

then 

j lK(xy)/(w)g(y)dxdy<./,(^'j (^jx>^*pdxj^(^jg>'dyy 
ldx^jK(xy}f(y)dyy jx^-*pdx. 

jx»-*dx(^lK{xy)f(y)dyy jpdx. 

In particular, when K (a;) = e“*, and .F(x) = J X (xy)f (y) dy is the 'Laplace 
transform’ off(x), 

jF»dx<r’’(i^ jz^tfpdx, lxP-*F’'dx<T^(^^ jpdx. 

861. If also K (x) is a decreasing function of x, and 

A(x) = 'SanK(nx), A„=ia{x)K(nx)dx, 

then 

IlAp<p(^ fx>^*a»(x)dx, 

jx<^*AHx)dx<pQ^ So.', 

ja>lx)dx. 
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852. If F(x) is the Laplace transform of f(x), and 1 <p ^ 2, then 

[For the last three theorems see Hardy (10). Theorem 350 may be 
deduced from Theorem 319 by transformation. It is not asserted that 
the oonstant in Theorem 352 is the best possible.] 


853. If 

K.i(x,y)- J Ko(xt)Ko(yt)clt, K^(x,y)=:J Ki(x,l)Ki(y,t)d£, 


f 




then SSiiCa (m, n) a«a„ ^ k SEJ^i (m, n) a^a „ . 

[See Hardy (9). The theorem is one concerning quadratic, not 
bilinear, forms.] 

354. 


SS 


log^) 22 


m + n 


355. 


= Maxim, n)’ 


[CJorollaries of Theorem 353. Observe that Theorem 354, when com- 
bined with Theorem 315, gives 


aog(m/n) ^ 
m— n 




in agreement with Theorem 342.] 

866. If o(sc)=J^a(t)b(x~t)dt, 

A»=far^ / xri(xn{x) j" dx, 0'= j ar‘(»ve(»)) dx. 

P>1. 9>1. y=«.+ P-l. 

then CkKAB. 

^_r(i-a)r(W) 

where ^ r“( T- y) 


867. If Oo=6o®0. c„=ao6„+Oi6„_i + ...+a„6«, 

« « .. .nH « TOtiafv the conditions of Theorem 356, and 
inequality is true with some K, 


HI 
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868* If ®o™ 5 q ™ ••• ® O 0” 0 

and «,=Sa»,6,,...<Vi (O^O, rr#=n), 

then Su„*<i |r ... 


859. Jfp> l,l>0, m> 0 and c(x) is dejQned as in Theorem 356jthen 

T{l)T(m)\ 


where 


{r{Z + m) 


r 


There is equality if and only if 

a(aj)=-4aj*~^e“‘^®, 6(a;)s 

where A^B^O are non-negative constants and C is positive. 

[For Theorems 356-359 see Hardy and Littlewood (3» 5, and 12).] 


360. If L(x) is the Laplace transform of f(x), and q > 1, then 


861. Ifp>l,g>L 

u=i + --lg0, 
p q 

and L, M are the Laplace transforms of/, gr, then 
I x-t^LMdx^KFQ. 


862* Ifp>l, 0gfi<l/p, and 

*» = ^{W +»»)>-»** 

then . 

[This may be deduced from Theorem 339 by the converse of Holder’s 
inequality. Many further theorems of the same general character as 
Theorems 360-362 are given by Hardy and Littlewood (1).] 

868. If A« is positive and 

p>l, il,i=ai+ag+. ..+»«, Ai+A|+ ... + 

then 

864. If Aa is positive and 

p>l, r>l, Ai’'+V+>”+VSc», 

then 

The result is not necessarily true when rs 1 (as it is when A^^ 1). 

[For these two theorems, which are corresponding extensions of 
Theorems 326 and 315, see Hardy and Littlewood (11).] 
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365. The inegiialities in Theorems 326 and 334 are the special oases 
^=sa?*(0<^< 1), ^=:loga?, of 

(i) <K{^) Sa.. 

[Knopp (2). This remark has led Knopp to a systematic investigation 
of forms of <f> for which (i) is true. See also Mulholland (4).] 

866. Suppose that ^ and ^ are continuous and strictly increasing for 
aj>0, and have the limits 0 or ~oo when a;->0; and that </> is convex 
with respect to 0 {§ 3.9). Then (i), if true for is also true for with 
KW^K(4>). 

[Knopp (2).] 

3 , 7 . - 

[Knopp (2).] 


17-2 



CHAPTER X 
REARRANGEMENTS 


10.1. Rearrangements of finite sets of variables. Iil what 
follows we are concerned with finite sets of non-negative nui^bers 


such as 


1 ^2 1 * * * 1 1 


^ 2 > 


®--n > • * • J 5 •••» •••> * 

we denote such sets by (a), (6), .... 

Taking for example the first set, in which assumes the values 
1, 2, n, we define a permtUaticm function as a function 
which takes each of the values 1, 2, n just once when J varies 
through the same aggregate of values. If 

(i=i,2,...,?2,) 

then we describe (a') as a rearrangement of (a). Similar definitions 
apply to other cases in which the range of variation of j is 
different. 

There are certain si)ecial rearrangements of (a) which are 
particularly important here. These rearrangements, which we 
denote by ( 5 )^ ^+a), (a*), 

are defined as follows. 

The set (d) is the set (a) rearranged in ascending order, so that, 
when the values of j are 1, 2, 

dj ^ d/j^ ^ ^ d„ . 

The set (d) is defined unambiguously by the set (a) although, 
when the a are not all difierent, there are ambiguities in the 
definition of the permutation function by which we pass from 
(o) to (d). 

In defining the sets {a+), (+o), (a*) we suppose that j varies 
from — » to TO. The set (a+) is defined by 

Oo+ S ^ a_i+ S aj+ ^ a_s+ ^ . . . 

and the set (+a) by 

+Oo > +o_i S +ai ^ +a_ J ^ ^ . . . . 
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There is one particularly important case, that in which every 
value of an a, except the largest, occurs an even number of times, 
while the largest value occurs an odd number of times. In this 
case we shall say that the set (a) is symmetrical. The sets (a+) and 
(+a) are then identical, and we write 

a+ = +a = a’*‘, 

so that a* is defined by 

A set (a*) may be said to be symmetrically decreasing. The sets 
(a+) and (+a) are sets arranged so as to be as nearly symmetrically 
decreasing as possible, but with the inevitable overweight of one 
side arranged systematically to the advantage of the right or the 
left respectively. All these sets are defined unambiguously by (a), 
though there may be ambiguities in the definitions of the 
corresponding permutation functions. 

We note that 

( 10 . 1 . 1 ) 

10.2. A theorem concerning the rearrangements of two 
sets. We begin by proving a very simple, but important, theorem 
concerning the set (d). 

SBS.® If (a) and (6) are given except in arrangement^ then 

Safe 

is greatest when (a) and (fe) are monotonic in the same sense and 
least when they are monotonic in opposite senses \ that is to say 

(10.2.1) S 

1 1 

It will be observed that, since we can add up the sum Safe in 
any order, we may suppose one set, say (a), arranged from the 
beginning in any order we please (in particular in ascending 
order). 

We may express the theorem equally well by saying that the 
Tnft.yi TnTiTn corresponds to ‘similar ordering* of (a) and (fe) in the 

» This theorem and Theorem 369 are valid for all real, not necessarily positive. 
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sense of § 2. 17, the minimuni to ‘opposite ordering’*. The theorem 
becomes ‘intuitive’ if we interpret the a as distances along a 
rod to hooks and the b as weights suspended from the hooks. To 
get the maximum statical moment with respect to an end of 
the rod, we hang the heaviest weights on the hooks farthesjb from 
that end. 

To prove the theorem, suppose that the (a) are in asoehding 
order, but not the (b). Then there are a,J and a k such that 
and bf>bif. Since \ 

0^6* +a^.bf- (a^bj + a*.6*) = (o*. - a^) (6,. - 6*) S 0, 

we do not diminish Soft by exchanging b^ and 6*. . A finite number 
of such exchanges leads to an ascending order of the b, so that 

Sa6 £ Soft. 

The other half of the theorem is proved in the same way. 

This argument establishes incidentally a variant of Theorem 
368 which is sometimes useful. 

369. If 

( 10 . 2 . 2 ) 'Lab'^'Lab 

for all rearrangemenU {b') of (6), tAew (a) and (6) are similarly 
ordered. 

For, if (a^— Ofe) (b^—b^) < 0 for any j, k, we can falsify (10.2.2) 
by exchanging b^ and b^. 

10.3. A second proof of Theorem 368. We have to consider 
analogues of Theorem 368 for more than two sets of variables. 
These lie a good deal deeper and cannot be proved in so simple a 
manner. We therefore give a second proof of Theorem 368 which, 
though quite unnecessarily complicated for its immediate object, 
will serve to introduce the method which we use later. We con- 
fine ourselves to the second inequality (10.2.1), and divide the 
proof into three stages. 

(1) Suppose first that the sets considered consist entirely of 

* Theorem 43 (with r s 1 and 27= 1) may be expressed, in our present notation, 
in the fom Eo, 
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0 8 and 1 s; we indicate such special eete by the use of Grennan 
letters a, b, ... . Then 

(10.3.1) 0^=0, 6^=6 

for all j'. In this case 

Sab^Sa, SabgSb, 


of sets 


and so Sab < Min (So, Sb) = Sa5. 

(2) Any set (a) may be decomposed into a linear combination 
(ai), (a*), ..., {oO“ 
of the special t 3 ^ considered imder (1), in such a way that 

(10.3.2) a^=aio/ + a2o,*+... + a'o/ (j= 1,2, ...,»), 
and 


(10.3.3) o^=aia/ + a*aj®+... + a‘a^' (j= 1,2, . ..,«), 

the coefficients a being non-negative. 

The method of decomposition will become clear by con- 
sidering a special case. Suppose that (a) contains (in some order) 
the three numbers A, B, O, where 0 ^ A ^ 5 g C, so that 

Oj — A , ^2 " Bf ttg — C*. 

Then 

Oi=A.H-(B-A)0+(C'-B)0, 

02 = A . 1 -t- (5- A) H- (C - 5) 0, 

08= A . 1-)- (5- A) 1 -I- (C7- .B) 1; 
and we may write 

dj = a} 5/ + Oy® -t- 5y®, 

where a^=A, a®=5— A, a?=C—B 

and (5^), (o®), (5*) are the three sets 

( 1 , 1 , 1 ), ( 0 , 1 , 1 ), ( 0 , 0 , 1 ). 

If then we perform the permutation which changes (o) into (o) 
and at the same time (5^), ... into (a^), we obtain 

Oy = (A Oy^ + «®0y®-|- a®0y®. 

» a< a'*' and moans «'« : in (10.3.1) above, o* i* a power, but this nse does 

not recur. 

is defined by this permutation. 
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In the general case we proceed in the same way, writing 

Oi=ai.l + (aj-Oi).0+(o8-O2).0+..., 

• 1 "i" (®2 • 1 "i" ip^ ~~ ^ 2 ) • ^ "t" • • • > 


This secures (10.3.3), and (10.3.2) then follows by rearrangement, 
as in the special case. \ 

(3) From (1) and (2) we can deduce the general theoreW For, 
decomposing (6) as in (2), we have \ 

af='LaJ‘af, 5y=SaPa/, 

p p <r 9 


j p <r j 


10.4. Restatement of Theorem 368. It will also be useful 
to restate Theorem 368 in different language. We suppose now 
that, in the sets (o), (6), j runs from — » to ». We write 
f{x) = 'ZajXi, g{x) = ZbjX^, 
and call ao=(S,{f (x)) 

tile central coefficient of f. Plainly 

e(/(*-^)) = ®(/{*)). 

Also S o,6,=Sav6 .=©(/<;). 

r +«=0 

The sets (a^+) and (+6_y) are similarly ordered, and if we write 
/+(a:) = So/a:^ +/(*) = 2+ a^a;^, 
so that, by (10.1.1), 

/+(»-!) = +/(*). 

then Theorem 368 gives 

Hifg)— S o,6,=Saj6_^<So^++6_^= S o,++6„=E(/++p). 

r +««»0 r +«—0 

Hence we deduce 


870. The central coefficient of 

S a^x^ S bjS^ 

— n — n 
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is greatest, for aU rearrangements of the a and b, when (af) and (b_f) 
are similarly ordered, in particular when (a) is {a+) and {b) is (+6) 
or when {a) is (+o) and (b) is (6+). 

10.6. Theorems concerning the rearrangements of three 
sets. We pass now to theorems involving three sets of variables. 

371.®' Suppose that the c, x, and y are non-negative, and the c 
symmetrically decreasing, so that 


Co^Ci = C_i^C2 = C_2^...^C2fc = C_jyi,, 

while the x and y are given except in arrangement. Then the bilinear 

form k k 

/S= S S c,_,x,y, 

attains its maximum when (x) is (x+) and y is (y+). 

It is evident that, if this is so, then the maximum must also be 
attained when {*) is (+a;) and {y) is (+y). 

372.'> Suppose that (a), (6), (c) are three sets satisfying 
(10.6.1) aoS«r = «-r. boib^=b_g, CoSc,=c_,. 

Then the maximum of 

S afb„Cf=tl (Sa,.af 'Lb^yf 'LCfd), 

r +»+<“0 

for rearrangements of the sets which leave a^, bg, Cg unaltered, is 
attained when (a), (6), (c) are (a*), (b*), (c*). 


373.e If (a), (6), (c) are three sets, of which (c) is symmetrical in 
the sense of § 10.1, then 


S S a++b^ci* 


= S +a,6,+c,*. 


It will b© sufficient to prove Theorem 373, since this includes 
the other two theorems. In the first place, Theorem 373 is Theorem 
372 freed from the restrictions (10.5.1), wholly in regard to (a) 
and (6) and partly in regard to (c). To deduce Theorem 371 from 
Theorem 373, we put 2k=-n, x^=a^,y„=b„ and suppose that 
the a and b outside the range { — k,k) are zero. We may observe 


» Hardy, Littlewood, and Polya (1). 

Hardy and Littlewood (4), Gabriel (1). 
« Gabriel (3). 
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finally that Theorem 370 is the aimple ease of Theorem 373 in 
which Cga 1 and the remaining e are 0. 

10.6. Reduction of Theorem 373 to a special case. We 
dividetheproof of Theorem 373 into three stages, as in flOp.The 
whole difficulty of the proof lies in stage (1), in which (o), m6), (c) 
are of types (a), (h), (c) : and we take this stage for granted for the 
moment and dispose of the easier stages (2) and (3). \ 

First, we may decompose (a), (6), (c) into sums of sets^ (a'*), 
(b"), (c’’), in such a way that ^ 

o^=Sa^o/, 6y=S/S‘'h^", c,=Sy’-c/, 

par 

and o^+=SotPa/''', 

Here the o, b, c are all 0 or 1, the a, /S, y are non-negative, and 
(a pointwhichdoesnotarisein § 10.3) the sets (c’’) are symmetrical. 
AUthis is proved by the method of §10.3 (2).“Whenwehavedone 
this, and proved the theorem for sets of type (o), (b), (c), we have 

S o,6,c,= S «/>j3"y^ S a/t>/c/ 

r +«+<"“0 r 4-«+<*0 

g S otf>iSv S o/++b/c,^*= s o,++6,c,*, 

p,<r,r r+fii+t"'® 

and the proof is completed. 

It remains to prove the theorem in the special case in which all 
o, 6, c are 0 or l.*> The set c, being symmetrical, contains an even 
number of O’s and an odd number of I’s. We write 
/(*)== S(^af, g(x)-U}^3f, fe(a;) = Sc,ai>. 

Since we may add any number of O’s to the sets, we may sup- 
pose that all the summations run from — n to ». 

Wo have also 

/+ (aj) = ]So,+af = ar* -f ...•+• 1 -1- ... -I- , 

+p(x) = S+6,**=ar-^-l- ... + 1 + ... -f-x®, 

h*(x)=Sc,*X' =ar-*’+... + l-i-...-f-®^, 

• In order that the lets (cT obtained by the prooeas of $ 10.3(2) ahonld bo 
eymmetrical, in the sense of § 10.1, we drop those which correspond to aero y. 

a So that, strictly, we should write a, b, C for a, 6, c. There is, hower-or, no further 
neoeaaity for this notation. 
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where B, B\ 8, 8', T are non-negative integers and 

(10.6.1) R^R'^R+l, 8^8'^8-\-l. 

We have to prove that 

(10.6.2) 6(M)ge(/++9A*). 

The inequality (10.6.2) may be made ‘intuitive’ by a geo- 
metrical representation. Let x, y be rectangular coordinates in a 
plane, and represent each non-zero coefficient of /, g, A by a line, 
»=r for o,= 1, y=« for 6,= 1, and x+y— —t for 0 ^= 1. If a^6,C( 
contributes a unit to ^(fgh), these three lines intersect. Each of 
the functions f,g,his represented by a family of parallel lines. 



Pig. 1. Graph of/, g, ft. 2- of/+. 


and difgh) is the total number of triple intersections of these 
lines. We represent /+, +g, h* similarly; /+ is also represented by 
R+l + R’ vertical lines, but now these lines are shifted as near as 
possible together. Typical figures are shown in Kgs. 1 and 2: 
here (a), (6), (c) are the sets 

1 , 0 , 1 , 0 , 0 , 1 , 0 , 0 , 1 ; 

1 , 1 , 0 , 0 , 1 , 1 , 1 , 0 , 0 , 1 , 

1 , 0 , 1 , 0 , 0 , 0 , 1 , 0 , 1 , 0 , 0 , 1 , 0 

respectively; and 

i?=l. i2' = 2, 8=2, S' = 3, T=2. 

It is intuitive that the number of intersections isj^atest when, 
as in Fig. 2, the diagram is as condensed as possible. 
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Our proof of (10.6.2) may be presented geometrically^ and 
followed on the figures. We reduce the actual case considered to 
a simpler one by taking away one horizontal and one vertical line 
from each figure, as is suggested by the thick Unes in the figures. 
We prefer however to state the proof in a purely analytical]form. 

10.7. Completion of the proof. There are three subor^nate 
oases in which the proof is easy. \ 

(1) If jB' = 0, /+ reduces to 1 and the result is included in 
Theorem 370. 

(2) If S' = 0, +gr reduces to 1 and again the result is included in 
Theorem 370. 

(3) Suppose that 

(10.7.1) R+8'^T, B' + 8<T. 

We have in any case 

(10.7.2) 

S (/grA) = S a,6,c< g Sa,. 26, = (i? + 1 + i?') (S + 1 + S'). 

But, when the inequalities (10.7.1) are satisfied, 

e (/+ +<7 A*) = (£ {(ar-«+'Si +... + «*'+«) (a:-r +... + 1 + ...+ a:r)} 
is the sum of all the coefficients of f++g, and therefore 

(10.7.3) 

® (/++flfA*)= 2o,6,= 2a,26,= (i?+ 1 + S') (S+ 1 + S'). 

r,» r $ 

The result follows from (10.7.2) and (10.7.3). 

We now consider the general case in which 

S'>0, S'>0, Max(S+S', S'+S) = «> r. 

We assume that the result has been proved for 
Max(S+S', S' + S)<n 
and argue inductively. 

Let xP be the highest power in/, af' the lowest in g, and write 
f-!eP=^, +g-ar^ = i!^. 

Since S' > 0, S' > 0, none of these functions vanishes identically. 

* See Qabriel (3). 
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Then S9h=^> + xP){xf’+^)h^^h + xK 

where 

Since the highest power in is lower than icP+" and the lowest 

power in is higher than there is no overlapping, and all 
coefficients in x are 0 or 1. Since the sum of the coefficients in h 
is 2T+ 1, it follows that 

e(xli)S2T+l, 

(10.7.4) e {fyh) ^ e{#A) + 2r+ 1. 

On the other hand 

(10.7.6) /+ +gh* = (<? + x^') (*-«' + ^)h* = fijih* + xh*, 

where _ 

X = x~^^ + a:®'”®' + x^'tfi 

= x~^~^' + . . . + + a;®'"®' + a;®'~®'+i + . . . + a:®’+® . 

The sequence of exponents in x is an unbroken one, extending 
from — B— 8' to R' + S. We know that either B+ 8' or S' + 8 
is greater than T. It R+ 8' > T then, by (10.6.1), 
R'+8iB+8'-l^T, 

and so the unbroken sequence from —TtoT, of length 2T+ 1, 
is part of the sequence of exponents of x. The same conclusion 
follows when B' + 8>T. Since h* has an unbroken sequence of 
exponents of length 2T+ 1, centred round the constant term, 
it follows that g = 2T + 1, 

and so, by (10.7.6), that 

(10.7.6) ® {f++gh*) = e + 2T+1. 

Now ^+(a:) = a:~®'~W+... + »® = ^(a:~^),* 

+0 (a:) = ar® + . . . + a:®'-^ = <? (ar-i). 

Also 

Max(iJ'-l + -S, i 2 +-S'-l) = Max(i?' + /S, iZ+fl^')-l=«-l. 
and so, by our hypothesis, 

(10.7.7) ® (#A) ^ d {<l>++>l>h*)=^ {f (a:-i) ^(ar-i)fe* (a:)} 

= © {^ (a:) ^ {x) k* (a;-^)} = © 

» It is not necessarily true that 

^+{x)=4{x) =iar-« + ... +xP^-\ 

since, if this polynomial is ‘overweighted’ at the wrong end. When B'=ie + 1, 

either formula is correct. 
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Next S4 — a;^ S 4- c^j Pp^j) 

J 

is not aSeoted because c_j=ej. Similarly 8^ and 

^9 ~ ^ yj»-j + ^p-i Vp+j 

+ Cj+j Xp^i yp_j + Cj_j Xp^.Mp+j ) 

are not affected. It remains to consider 8, and 8^. 1 

The contribution of the pair Xp_f , Xp^^ to 8, is \ 

Xp^ S {c_i+jyp^ + c^i^yp^j) + Xp^t S + Ci^jyLj), 

J J ^ 

and the increment produced by fip is \ 

~ ~ ®j>+<) 2 (®<-j ~ ®<+j) {Vp-j ~ Vp+j)' 

The total change in 8,, is the sum of this increment over and 
is positive, provided that there are J and i I, since the three 
differences written are respectively positive, positive, and 
negative. Hence 8^ is increased if there are J and i#:/; and 
similarly 8^ is increased if there are I and j 4= J- Finally, 8 is 
increased if either of these conditions is satisfied. 

If j) 4: 0, there are i 4: 7 and j 4° J ; and then 8 is increased unless 
A is right with respect to p. In any case, whatever p, 8 is not 
diminished. 

Suppose now that A is not a C. Then A is wrong with respect 
either to some p 4=0 or to p=0 and the pairing (10.8.3). The 
argument above, or the similar argument based upon the pairing 
(10.8.3), then shows that 8 is increased by Qp (or the corre- 
sponding substitution based upon the other pairing), and that 
A is not a K. Hence the K are included among the 0\ But if a O' 
is not O, then the substitution Qg replaces it by O and does not 
diminish 8] and therefore 0 is a jK”.* 

It does not seem to be possible to prove Theorem 373 by any 
equally simple argument based upon a substitution defined 
directly. 

10.9. Rearrangements of any number of sets. There are 

^ The argument is the same in principle as that used by Hardy, Littlewood, 
and P61ya (1), and substantially reproduced by Hardy and Littlewood (6). We 
have however expanded it consid^ably. Dr R. Rado having pointed out to us that 
the original form of the argument was not conclusive. Another form of the proof is 
indloated in Theorem 389 at the end of the chapter. 
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analogues of Theorem 373 for more than three sets (a), . . which 
may be deduced from Theorem 373 itself. 


374.®' If (a), (6), (c), (d), . . . are finite sets of non-negative numbers, 
and (c), (d), ... are symmetrical, then 

(10.9.1) S a^b^Cid^...-^ S o^++6gC,*d/.... 




r-f s4-< fw-f ...=0 


We assume the theorem to be true when there are Jfc~l sym- 
metrical sets (c), (d), ... involved, and prove that it is true when 
there are fc. We shall make use of the following theorem, which is 
of some interest in itself. 


376. If (c*), (d*)j ... are symmetrically decreasing sets, then the 
set (Q) defined by 

(10.9.2) 2 

.=n 

is symmetrically decreasing. 

It is enough to prove the theorem for two sets (c*), (d*), since 
its truth in general then follows by repetition of the argument. 
We may agree that, when there is no indication to the contrary, 
sums involving several sufl&xes are extended over values of the 
suffixes whose sum vanishes. 

It is plain that C-n — Qn • Further, for any set {x), we have 
Qm^^^rn Qn ~ Qm* 

m 

by Theorem 373. It follows, by Theorem 369, that the 
similarly ordered to the , and therefore, since is an even 
function of m, that the set is symmetrically decreasing. 

This is the most elegant proof, but there is a simpler one which does not 
depend upon Theorem 373. 

We drop the asterisks for convenience and suppose n ^ 0. Then 
Qn = ^<^n+r^-jr + ^n+l-r^r-1 » 
the summafions extending over 1. Similarly 

Qn+l = dr • 

Subtracting, and using the equations d_,. = and di_r = d^^i , we obtain 
Qn - On+1 = S {c„+,.(d^-di_,.) + (dr-l ^ dr)} 

= S “• ^n+r) (df— 1 dr). 

Since |n+l-r|<n + rforn^0,r^l. each term here is non-negative. 

• Gabriel (3). The case of the theorem in which all the sets are aymmetrioal was 
proved by Hardy and Littlewood (4). 

i8 
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Betutning to the proof of Thecnrem 374, we define as in 
(10.9.2), and P„ by 

Pfn— S 

r+5—w 

Then . . . = . . . g SP+ c,* d*..., 

by the case i — 1 of the theorem^. That is to say, 

m m 

where ^(m) is a permutation fdnotion for which Pm= P^<ffl)\ 
Sa».6,c, d„e, . . . S So,6, ^ 2o,++6, 

=So,++6,Q„=S(v++fc,c,*d„*e,*..., 

which is (10.9.1). 

From Theorem 374 we can deduce >> 


876. Given any finite number of seta (a), (6) we have 


= Sa;rai, btbt.c^ct, .... 

fj ft «i 6g fj r, 


10.10. A farther theorem on the rearrangement of any 
number of sets. In Theorems 373 and 374 two of the sets, (a) 
and (6), were arbitrary, but the remainder were subject to the 
condition of ‘symmetry’. This restriction is essential; if (a), (6) 
and (c) are unrestiicted, it is not possible to specify the maximal 
arrangement generally by means of the symbols o+, +o « 

There is however a less precise theorem which is often equally 
effective in applications. 

877. For any system of k sets (a), (6), (c), . . . 

S afb,c,...^K{k) 2 afbfcf..., 

where K=‘K{fc)i8a number dependirvg only on k. 

We supi)ose £>=3; the argument is eBsentially the same in the 
general case. 

We define ihe sets (j8*), (y*) by 

(10.10.1) y*=c;J; (7»^0), 

( 10 . 10 . 2 ) y*^^vlirmoy, 

• With Pt e, d , ... for a, b, e , nnoe (c) ia lEfyiiimetrioal, 

b Gabriel (3)« ^ See Theorem 3S8 at the end of the chapter. 
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and (j8) and (y) as the sets into which (j3*) and (y*) are changed 
by the permutations which change (5+) into (6) and (c+) into (c) 
respectively. Then (^) and (y) are symmelaical seta. Further, 
since bt^ g 6+ and cl „ S c+ when m ^ 0, we have 

c+^v* 

for all n, and so 

(10.10.3) C„gy„ 
for all n. 

We shall also require an inequality for /S* and with m< 0. 
We have b* S 6i„+i and c+ g cl„+j for n ^ 1, and so, by (10.10.1) 
end (10.10.2), 

(10.10.4) (m<0). 

Using (10.10.3) and the symmetry of (jS) and (y), we find 

8=- S a^b^c,^ S OrPiViS S 

r-fa+<-«0 r+a+i— 0 r+«+<—0 

by Theorem 373. The last sum is 

( L + S + S + S )a^^*yf; 

aZO.tm «<0,(&0 s£;0,«0 «<0,(<0 

and so, by (10.10.1) and (10.10.4), 

( 10 . 10 . 6 ) 

Si L <6.+ C++ S a+6;+iC+ 

«;iiO,(£0 «<a,<£0 

+ S <6,+ c+i+ S <6+.iC+, = Si+Sj+/Sf3+S4. 

8^0, t<0 8<0,t<0 

In 8^, «<0 andr+s + <=0, so that either r>0 or «>0. In the 
first case a+ i o;Li and in the second c,+ i . Hence in any 
case, in 8^, 

(10.10.6) = + ®r"' 

Similarly, in 8^, 

(10. 10.7) o+ 6+ c,++i g o,+_i6+ c + 1 + 0 + 6+_i c+ ^ . 

Finally, in (S*, «<0, <<0 and r+«+«=0, so that r^2 and 
a+ga;L2,and 

(10.10.8) o+ ^ • 

Ji now we substitute into (10.10.6) the upper bounds for the 
typical terms given by (10.10.6), (10.10.7), and (10.10.8), and 
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observe that, in these upper bounds, the sum of the suffixes is 
always 0, we obtain 

S^il + 2+2+l) S <6;c+= 6S 

r+«+<*“0 r+a-hi^^O 

which proves the theorem. 

I 

10.11. Applicadoiis. These theorems have important applications to 
the theory of Fourier series. It is easy to deduce from Theorem 376* that if 

f{6) = £ , /+ (6) = I . 

-jR -R 

where «;.= | a,. |+, and is a positive integer, then 

and this relation between trigonometrical polynomials may be extended 
to functions represented by general Fourier series. Series of the type 

have particularly simple properties. They converge uniformly except at 
the origin and congruent points, where the function which they represent 
has in general an infinite peak; and the ratio 

I" |/+(e)l*‘(W:S(|r| + l)‘*-*oc,« 

lies between positive bounds depending only on We thus find, for 
example, that 

f\\m £ (1 r I + ] 

For fuller developments see Hardy and Littlewood (9), Paley (3). 

10.12. The rearrangement of a function. The theorems of 
§§10.1-10.10 have analogues for functions of a continuous 
variable. 

Suppose that is non-negative and integrable in (0, 1), so 
that it is measurable and finite almost everywhere. If M (y) is 
the measure of the set in which (y) is a decreasing 

function of y. The inverse ^ of Jf is defined by 

and ^ {x) is a decreasing function of x defined uniquely in (0, 1 ) ex- 
cept for at most an enumerable set of values of x, viz. those corre- 

* See Gabriel (3). A less precise inequality was given by Hardy and Little- 
wood (9). 
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sponding to intervals of constancy of M (y). We may complete 
the definition of f {x) by agreeing, for example, that 

^(*) = i{^(®-0) + ^(a5+0)} 
at a point of discontinuity®. 

We call ^{x) the rearrangement of <f>{x) in decreasing order. 
It is a decreasing function of x which has, in general, an infinite 
peak at the origin. 

The measure of the set in which ^(x)^y is M(y)fi It follows 
that the two (in general quite different) sets in which 
yi^<f>{x)<y^, yi^^(x)<y^ 

have the same measure, and that the same is true of the sets in 
<f>{x)>y, ^(x)>y. 

We may say that the functions <f> (a:) and ^ (x) are ‘ equi-measur- 
able’; they have equal integrals over (0, 1) and 


f F{f)dx= f F(<l>)dx 
Jo Jo 


for any measurable F for which the integrals exist. 

We may define ^ (a:) similarly for a (f> (x) defined in any interval 
of X, provided that, if the interval is infinite, M (y) is finite for 
every positive y. 

If (a:) g <l> (x) then plainly {x) g ^ {x). Suppose in particular 
that <f>i (x) is ^ (a:) in JS and zero in CF. Then 

(10.12.1) I 4>{x)dx=j<f>i(x)dx=j^ ^i{x)dxgj^ f{x)dx. 
We shall use this inequality in § 10.19. In particular 


j’’<f>{t)dtgjy[t)dt 


( 10 . 12 . 2 ) 

if ^ (x) is defined in (0, o) and 0 ^ a: ^ a. 

*> obvious on drawing a figure. It must be remembered that ^ (») 

may have intervals of constancy, corresponding to discontinuities of M (y). it is 

however easy to prove that Jif(y-0) = Jlf(y) for aU y, and so that the assertion m 

the text is true even for these exception^ y. In fact 




■M{y)=mS„ 


where <S„ is the set in which y-n-^g^<y. and *1^8 limit of mS„ is sero. 
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Another type of rearrangement of a function will be important 
in what follows. Suppose, for example, that <f> {x) is defined for all 
real, or almost all real x, and that M (y) is finite for all positive y. 
We may define an even function if>* {x) by agreeing that 

r{iM{y)}=y 

and that <f>*{ — x) = (!>’* (*) ; or, what is the same thing, that (*) 

is even and <l>*(x}^^{2x) 

for positive x. Then <j>* (x) decreases symmetrically on each l|ide 
of the origin, where it has generally an infinite cuspidal peak. We 
call ^*{x) the rearrangement of <f>(x) in symmetrical decreasing 
order. 

10.13. On the rearrangement of two functions. We begin 
by proving an integral inequahty corresponding to Theorem 368 . 

378. WhetJier a ia finite or infinite, 

na ra 

I (fnlidx^ ^tjfdx. 

Jo Jo 

We prove this by an argument similar to that of § 10.3. In the 
first place, the theorem is true for functions which assume only 
the values 0 and 1. For suppose that E and F are the sets in 
which ^=1 and ^=1 respectively, and S, F the analogous sets 
corresponding to ifi. Then the first integral is m{EF), the 
measin« of the set EF which is the product of E and F, and 
m(EF)^Mm.{mE,mF)=Min(mE,mF)=m{EF). 

Next, the theorem is true for functions which assume only a 
finite number of non-negative values. In fact, following the lines 
of § 10.3, we can represent such a function ^ in the form 

where the a are non-negative, the ^ are always 0 or 1, and 
^ - 1 - ^2 + • • • ■!■ • 

The inequality then follows from a linear combination of in- 
equalities already proved. 

Finally, we prove the theorem in the general case by approxi- 
mating to ^ and tf), by functions of the type just considered. We 
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do not give the last two stages of the proof in detail, sinoe the 

arguments willreour in theproofofthemoredifficultTheorem379. 

10.14. On the rearrangement of three functions. We come 
now to what is our main object in these sections, the integral 
theorem corresponding to Theorems 372 and 373. 

879.* If f{x), g (x), and h (x) are non-negaiive, andf* {x), g* (*), 
and h*(x) are the egui-measurable syminetricaUy decreasing func- 
tions, ffien 

(10.14.1) f = f I* f{x)g(y)h{-x-y)dxdy 

J —CO J —00 

g r r f*{x)g*{y)h*{-x-y)dxdy=I*. 

J — 00 J —00 

We may plainly suppose that none of f, g, hia nuU. We may 
also replace —x—yhy ±x±y without changing the significance 
of the inequality. 

We prove the inequality (1) for functions which are always 0 
or 1, (2) for functions which take only a finite number of values, 
and (3) for general functions. As with Theorem 373, the whole 
difSculty lies in stage (1). We take this stage for granted for the 
moment and begin by showing that, if the theorem is true in this 
special case, it is true generally. 

A function which takes only a finite number of non-negative 
values 0, Oi , 02 , . . ., a„ can be expressed in the form 
f{x)~IX.ifi{x)-\‘Cl2f2[x)'h ... + *»/n(®)» 
where the a are positive, the/^ take only the values 0 and 1, and 

For we may suppose 0 < < 02 < .. . < «». take 

0C2“a2”“aj, ..., 1 * 

and A“M/^Oi). *0 (/<Oi), 

/g=l (/^Oj). =0 

A moment’s oonmderation shows that we then have also 

f* {x) = ajA* (®) + “sA* (*) + ...+ «n/«* (*)• 

• F. Kiess (8). 
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If we suppose that each of f, g, h takes only a finite number of 
values, and decompose them in this way, then (10.14.1) follows 
from the combination of similar inequalities involving triads 

To pass from this case to the general case, we approxindate to 
fy 9y A by functions which take only a jfinite number of valu^. We 
can approximate to/, for example, by the function/^ definlpd by 

/«=^ 1.2, ij, 

and to g and h similarly. Then/,^g/, /^* g/*, and similarly for 
g and h. Hence (assuming that the theorem has been proved for 
the special type of functions) we have 

fn= f f 

J —00 J - C 

and so 7=lim S I*. 

It remains to prove the theorem in the special case when/, g, h 
assume only the values 0 and 1. It is however convenient first to 
make a further reduction of the problem. 

First, we may suppose that the sets F, G, and H in which 
/, g, and h assume the value 1 are finite. If ttm of these sets are 
infinite, then two of/*, g*, and A* are 1 for all a;, in which case 
/* = (»•» and there is nothing to prove. Suppose then that just 
one of the sets, say F, is infinite. Let Fg be the part of jP in 
(—N, N), let N be the smallest number for which mFg>2n, 
and define /„ as being f in F^ and 0 outside. Then (assuming 
the theorem to have been proved when the sets are finite) 

/«= f" f“ fn9hdxdy< r r f*g*h*dxdy 

J —ooj —00 J — ooj —00 

«= r r f*9*^*^dy% f* r f*g*k*dxdy^I*, 

J — nj —00 J — 00 J —00 

and so I =lim ^ I*. 

Suppose then that f(x) assumes the value 1 in a set i? of finite 
measure. We can represent E in the form € + e — e', where € is a 

* Compare the similar argument in §10.6. 
b Unless the third function is null. 
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finite set of non-overlapping intervals, and e and c' are sets of 
arbitrarily small measure^-; and the sets in which g and h assume 
the value 1 can be represented similarly. It is also plain that, 
since/, gr, and h do not exceed 1, small sets c, ... make a small 
difference in the integrals I and I*. We may therefore suppose 
that the sets in which /= 1, gr= 1, and 1 are finite sets of 
intervals; if the theorem has been proved in this case, its truth 
in the more general case follows by approximation. 

Next we may suppose, on similar grounds, that the ends of all 
the intervals are rational ] and then, by a change of variable, that 
they are integral. The theorem is thus reduced to dependence 
upon the case in which each of the sets in which /== 1, 1, or 

A = 1 consists of a finite number of intervals (m, m -l- 1), where m 
is an integer. 

Finally we may suppose, if we please, that the number of 
intervals in any or all of the sets is even^ since we can replace each 
interval by two by bisecting it and effecting another change of 
variable. 

10.15. Completion of the proof of Theorem 379. It is 
convenient to replace /(«) by /(-»), as plainly we may without 
affecting the result. If we do this, write s, t for y and then make 
the substitution « = a: — we obtain 

( 10 . 15 . 1 ) 

f{t-x)g(t)h(-x)dxdt-- | h{-x)x{x)dx, 

, J -to 

( 10 . 16 . 2 ) 

J — aoj —00 

where 





WeBuppo^forth. moment 

. See for example de la ValWe 
I ^(*) to not to be oonfoeed with x* (*)• 
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fimotionB of eete (fonotioBB assnnaing the values 0 a>nd 1 only), 
without using the further simplifications shown to be permissible 
at the end of § 10.14. We denote the sets in which/, ..., /*, ... 
assume the value 1 by F , ..., F*, ...; each function vanishes out- 
side the corresponding set, and F*, ... are intervals syn^ietric 
about the origin. We suppose that 

mF’=>inF*Bs2It, mO=mQ*=‘2S, mH=mH* = 2^. 
With this notation*, we have 
(10.16.4) 

r r f{t-x)g(t)dxdt^ r /(-«)<?« r 
J — 00 J —00 J —00 J — flo 

(10.16.6) /*= *x{x)dx. 

J -T 

If a; is fixed, and t—x describes the set F, then t describes a set 
F^ obtained by translating F through a distance x. If we define 
F*g^ = (-f**)* similarly, then the functions (10.16.3) maybe written 
in the form 

(10.16.6) x(o:)=rn{F^O), *xi^) = m(F%G*). 

From this formula we can calculate *x {x). Let us suppose, as we 
may, that 

(10.16.7) B^S. 

Then *x (*) “ continuous, 

(10.16.8) •x(®) = 0 (1*1^12-1-5), *x{x) = 2B (1*1^5-12), 
and *x(®) is linear in the intervals ( — B—S, —S + B) and 
{8—B,B+8). The graph of •x (®) “ shown in Fig. 3. 

Suppose now that 

(10.16.9) J2+5g2’. 

Then it follows from (10.16.8) that 

/•T r^+8 

I*= I *x(x)dx= I 

J -T J -B-B 

and the result of the theorem follows from (10.16.4). We have 
thus proved the theorem under the restriction (10.16.9). It is 
idsoplainly trueif ^=0 or 5>a0 (when F or <? is null). 

Ghosen to wnphasize the ptroUelism of tho argument with that of $§10.^7. 
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So far F, (?, H have been arbitrary sets of finite measure. We 
now make the further specialisation explained at the end of 
§ 10. 14, supposing that F^O,H are sets of intervals (m, m + 1), the 
numbers ofintervals being 2 J2, 28 , 2 7 respectively. We may if we 
please suppose these numbers even, but we shall argue inductively, 
and it is more convenient to adopt a slightly more general hypo- 
thesis and to suppose only that 2R^28+2T even. In these 
circumstances R, 8, and T are not necessarily integral, but 2R, 
28, 2T and 

(10.16.10) = 


ZR 



are integral. We have already proved that the theorem is true if 
/X ^ 0, and it is also true if iZ = 0 or 5 = 0. It is therefore enough to 
establish its truth when 

(10.16.11) fi = n>0, R>0, 8>0, 
on the assumption that it is true when jLt = ri — 1. 

We denote by F^ the set derived from F by omitting the last 
interval of F on the right; similarly Oi is 0 less the last interval 
on the right of G, Gtenerally, sets, functions, or numbers with 
suffix 1 are derived from Fi and Gi as the corresponding sets, 
functions, or numbers without suffixes are derived from F and (?; 
thus/i* is the rearrangement of fi, the characteristic function of 
-Pi, and ♦xi («) is the ‘Faltung’ of A* and Fi* is the interval 
(•— -K— i)> 

and generally, iZ and S are replaced by iZ - J and - i when we 
pass from F,QtoFi,Oi.By the inductive hypothesis 

(10.16.12) 
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The function "'xi (^) vanishes for | x | ^ iZ + /Sf — 1, is equal to 
— 1 for I X I ^ and is linear in the intervals remaining; 

and T^iZ+S- 1, by (10.16.10) and (10.16.11). Hence* 

for —T^x^T, and, by (10.16.6), 

(10.16.13) I* - /i* = - *Xi = 2T. 

We have now to consider 

(10.16.14) /-/i = J”^A(-x){x(x)-xi(a;)}dx. 

Here, after (10.16.6), 

(10. 16.16) X (*) -Xx{^) = rn{F^O)-m (F^ O^). 

This function is plainly linear in any interval {m, m+ 1), and 
therefore assumes its extrema for integral values of x. Suppose 
then that x is integral. In this case the set Fj,0 is composed of 
full intervals (m, »»+ 1), and, when we remove the intervals of 
Fg. and O furthest to the right, either one or no interval of F^O 
is lost, one if the extreme interval of either set coincides with an 
interval of the other set, and none if there is no such coincidence^. 
Hence x (®) “ Xi (®) is 1 or 0 for integral x, and therefore 

(10.16.16) 0gx(x)-xi(a:)gl 
for all X. 

From (10.16.14) and (10.16.16) it follows that 

(10.16.17) 0^/ — /j= I — f dx^2T, 

J H J a 

and&om (10.16.13) and (10.16.17) that 

(10.16.18) 

Finally (10.16.12) and (10.16.18) give I ^ I*; and this completes 
the proof®. 

» See Fig. 3. 

^ We cannot lose two intervals because the intervals removed from and G 
are the furthest to the right of their respective sets. This is the essential point of 
the proof. 

^ The proof follows the line indicated by Zygmund (1). It is, however, con- 
siderably longer, and necessarily so, since Zygmund's proof is not, as it stands, 
oonolusive. 

We proved that (10.15.16) is true when the intervals removed from F and Q are 
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10.16. An alternative proof. The proof of Theorem 379 given 
by Biesz is also very interesting. We can simplify it by reducing 
the theorem, as in §10.14, to the case in which/, gf, h are each equal 
to 1 in a finite set of intervals and to 0 elsewhere. We represent 
the variables x, y, z on the sides of an equilateral triangle, taking 
the middle point of each side as the origin and the positive direc- 
tions on the sides cyclically. Then a;-fyH-2==0is the condition 
that the points a;, y, z on the sides should be the three orthogonal 
projections of a point of the plane*^. 

The functions /(x), giy), h(z) are the characteristic functions 
of three sets Ei, Eq, each consisting of a finite number of 
non-overlapping intervals, and /*(^), g*{y), h*(z) are the char- 
acteristic functions of the three intervals E^*, E^^y E^* of lengths 
E^^E^yE^^ symmetrically disposed about the three origins. IfE^^ 
is the set of those points of the plane whose three projections 
belong to E^, E^y and £3, and E\z^ is defined similarly, then^ 

J = sin j 7 rJS/i 23 > 

and what we have to prove is that 


(10.16.1) 




The figure E*i 2 s “ defined by drawing six lines perpendicular 
to the sides, and is a hexagon unless one of Ei,Ei, E, is greater 
than the sum of the other two, in which case it reduces to a 
parallelogram. We begin by proving (10.16.1) in the latter case. 
Suppose for example that E,^E,+E,. Then E*,^ reduces to 


It would not 

arbitran, intemJs. Suppose, for example, that ^and o w tn 

k m ^ ““ 

unit interval from each set at each step , but 

x(0)-X4(0)=8. 

assertion ( 1 , 176) ‘those [the which he does not state ex- 

untrue unless his oonstruotion is rcstnc point, since it is the kernel of 

plioitly. It is essential to go closely mto detail at this point, sin 

the proof. ^ of the triangle, then 

» K P is the point in question and 0 is the centre J 

V+y+* = PO {008 . + 00S (« + !»)+<»• <“ + »"»-'’• 

» Vf 9 rmeB, both for the set Ei meaBUte. 

- “ when used as a measure. 18 of course a piau 


0 S. 


'it»> 
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the set of points projecting into Ei* and on two of the 
sides, while Ej^ is included in the set Ei^ defined similairly. 
Hence 

^ == ooseo l^.E^E^^ oosec J w . Ej* E^* = j =» E\^ . 

This proves the theorem when E*^^ is a parallelogram. \ 
Passing to the case of the hexagon, suppose for example that 

E^> E'l'^E^, E^<E-y^E^. 

We define sets E^{t), 

and corresponding intervals Ej* (i), E^ {t), E^* (<), by subtracting 
from each E^ a set of measure t at each end If ( increases from 0 to 

Ei{t), E^{t), E^(t) decrease from E^^E^jE^to sets ■^s(^o)> 

E^{t^ whose measures satisfy 

^i(<o) + ^2(<o) = ^8(«o)- 

The hexagon then reduces to a parallelogram, so that 

(10.16.2) -^ 123 (^ 0 ) ^ •®'*128(^o)* 

If we can prove also that 

(10.16.3) -^128“ ^ 123 (^ 0 ) s -^*128 ~ -^*188 (^o)> 
our conclusion will follow by addition. 

We prove (10.16.3) by comparing the derivatives of 

— Ei2a (t), <f>* {t) = — IS? *123 (t). 

In the first place, the difference between E*^^ {t) and E*ifg {t + h) 
is a hexagonal ring whose area is hP (f) + 0 (h^), where P (t) is the 
perimeter of the hexagon corresponding to the value t, and so 

^ = P (< ) = cosec J TT {Pi (0 + Pa (t ) + Pj (<)}• 

On the other hand the three sets 

^ That is to say +i^i(0 +i^r(0, 

where Si(t) lies to the left» and to the right, of Ei(t), and 

mEi (0 as mEi' (t) as U 
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fa all <rf dxinle™as,6Mh»f length ». for 

^1 j “* mtorraU define a hemgonal ring* w5i 

fa f fmW-®„,((+l). The derirXe f 

fa ^ totjd length of those parts of tie enter bonndary of «de 
rmgwhioh also belong to £,„((). Projeotdng these parts of the 



4. The deorement of Sm{t), 


boundary on to the sides of the triangle, as indicated in the figure, 
we see liiat 


ooaeoin{EAt)+E,(t)+EM-^- 

From this (10.16.3) follows by integration, and this completes 
the proof of the theorem. 

^ See Fig. 4. In the figure the sets Ei •*. are blackened on the sides of the 

triangle, the set (< + A) is shaded, the twelve perpendioulara are dotted, and the 
boundary of E^(t) - E^ (< + A) is indicated by a thick line. 
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10.17. Applications. The special case of Theorem 379 corre- 
sponding to Theorem 371 is 

380. If h (x) M aymmetncaUy decretmng, then 

poo poo 

f(x)g{y)h{x-y)dxdy 

J —00 J —00 

^1* f f*{x)g*(y)h{x-y)dxdyXl*. 

J —CO J —CO ^ 

We shall now apply Theorems 371 and 380 to the speoial\oases 

and h{x—y)=\x—y\-^. 

381. If 

(10.17.1) v>\, g>l, i-f-->l, A=2-^--- 

P g Pi 

(so that 0 < A < 1), and 
then 

|r— s|^ 

where the dash implies that r^s, and K = K(p,q) depends on p and 
q only. 

382. ///(*) ^0, g {y)^0, p and q satisfy (10.17.1), and 

I* fP(x)dx^F, I* g^{y)dy^O, 

J —00 J —00 

then /- r r •(^^^^dxdy^KFVpGPJ«. 

The proofs of the two theorems are practically the same. We 
give that of Theorem 382 

It is plain, after Theorem 380, that we may replace/ and g by 
/* and g*. We then divide 7 into four parts corresponding to 
the four quadrants of integration. The north-east and south-west 
parts are equal, and so are the north-west and south-east parts, 
and the two latter do not exceed the two former**. We need 

* For that of Theorem 381 see Hardy, Littlewood, and P61ya (1); for a deduction 
of Theorem 382 from Theorem 381 see Hardy and Littlewo^ (6). 

^ The north-west and south-east parts could be accounted for by the easier 
argument of § 0.14. 
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therefore only consider the north-east part. Hence, 

OUT notation again, it is sufiacient to prove that 

wlior© now f Bnd g aj© positiv© and docroasing, and F and 0 ar© 
defined by integrals over (0, oo). W© write 


e7== t^2> 

where and Jj ar© integrals over the octants y^x and x^y 
respectively. 

We have J“/(r)dr 

Since g{y) decreases, and (x—y)-^ increases, in (0, x), 

say, by Theorem 236. Hence 


1 f® 

1 ^ iZTX J S'! 

By Theorem 189 

1 / r® X 

(10.17.2) pr/(a;)a;-J’'^(fo;j 


Butp'>g', by (10.17,1), and 

gi (®) = S' iy) dy g a;"®' , 

again by Theorem 189. Hence 
(10.17.3) 

g^' (*) ar-f ^ g^^ (x) x-^'^ = 


(since 




From (10.17.2) and (10.17.3) it follows that 

J-i^K 

by Theorem 327. 

The discussion of is similar, and the theorem follows. 


B1 


19 
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883. Swppoae that f{x) w non-^^egative and IP, where p>\,in 
(0, oo), thai 

(10.17.4) 0<«<^, 

p I — cup 

and that 

(10.17.6) /^(ar)= jV(y) ix-y)«-^dy. 

Thenfg^(x) is Ip in (0, oo) and 

rco / /•«> \Qip 

(10.17.6) J ^ f^odx g i: y ^ fPdxj , 

where K=K{p,oL) = K{p,q). 

Suppose that (7 (^) is any function of L®', and that 


^ , 1 1 « 1 1 

A=l-a=l--+- = 2 >. 

p q p q 

By Theorem 382, 

and a fortiori 

J%(x) J'^iy 

a ® \tip / r® \ W 

Since this is true for all g, it follows, by Theorem 191, that 

a ® \ Va / r® \Vp 

.H ■ 


which is (10.17.6). 

The proof shows that the result is also true when/g, (x) isdefined 

Theorem 383 embodies a result in the theory of ‘fractional integration’. 
Liouville (1) andBiemann(l, 331-344) defined the integral /« (a;) off(x), 
of order a> as 

( 10. 17.7) /.(*) = I */(y) (»-»)•-» dy. 

The lower limit a is the ‘origin of integration*; a change of origin changes 
/a in a maimer which is not trivial formally, though unimportant for 
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theorems of the type considered here. It is easily deduced from Theorem 
383* that, iffiaJj^in (a, 6), where ~oo<a<6^ oo, a< 1/p, and fa is the 
integral off, of order a and with origin, a, then fa is in (a, 6). When a > 1/p, 
fa is continuous, and indeed belongs to the *Lipschitz class’ of order 
a~l/p. 

In applications of the theory, f is usually periodic. It was observed by 
Weyl (3) that the reference to an origin a is in this case inappropriate: 
Weyl accordingly modified the definition as follows. If we suppose that 
the mean value of/ over a period is zero (a condition which we can cdways 
satisfy by subtracting an appropriate constant from/), then 

j _ f(y){x-y)<‘-^dy 

converges at the lower limit, and we may take o= — oo in (10.17.7). Our 
theorem conoeming the Lehesgue classes may he extended to this case also. 

10.18. Another theorem concerning the rearrangement 
of a function in decreasing order. The theorem with which 
we end is important primarily for its function-theoretic applica- 
tions, but the proof which we give*> is interesting independently. 
The theorem may be stated in two forms. 

384. Suppose that f(x) is non-negative and integrable in a finite 
interval (0, a), thatf{x) is the rearrangement of f(x) in decreasing 
order, that 

(10.18.1) 0 (a:) = 0 («,/)= Max I f{t)dt, 

and that@{x) is the rearrangement of @(x) in decreasing order. 
Then 

(10.18.2) 

for 0<a;ga. 

886. Suppose that f(x) satisfies the conditions of Theorem 384, 
and tAof s (y) is any inoreasing function of y defined for y^O. Then 

(10.18.3) J%{0 {x))dx g J/ 

We begin with two preliminary remarks. 

» See Hardy and Idttlewood (6). 
b Due to F. Rie«* (1®)* 


19-2 
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(1) We prove Theorem 384 first and deduce Theorem385. 

Since © (a?) and 0 (*) are equimeasurable, 

J «{0(a;)}da;=J^A{0(a;)}da:. 

Hence (10.18.3) follows fix)m (10.18.2). 

That (10.18.2) follows from (10.18.3), so that the two forms 
of the theorem are equivalent, is a little leira obvious, but is 
proved in Theorem 392.* The first implication is sufficient for our 
purpose here, since it is in the second form that the theorem is 
used in the applications. 

(2) If = = 

then @0 (a;, /) g ©0 (05, /) = 0 {xj ) = ^ I* f{t)dt, 

xj 0 

by (10.12.2), and 

(10.18.4) J“s{©o(a5)}da:S j^J*/(0d<| dx. 

This, a much more trivial inequality than (10.18.3), is the 
analogue for integrals of Theorem 333. 

10.19. Proof of Theorem 384. We suppose, as we may, that 

a= 1. 

We consider a point Xf, for which 

a5o>0, ©(a:o)>0, 

write 

(10.19.1) 0(a:o)=p + € (p>0, e>0), 
and consider the set E defined by 

(10.19.2) OgarSl, 0(a5)>p. 

Since 0 (x) and @ (a:) are equimeasurable, E has the same measure 
as the set in which 0 (a:) > p. This set is at least as large as the set 

• See the Miscellaneotis Theorems at the end of tho chapter. 

Theorem 385 was proved by Hardy and Littlewood (8), who deduced it by a 
limiting process from the analogous theorem for finite sums (Theorem 394)* Their 
proof of Theorem 394 was elementaiy but long, and a much shorter proof was found 
by Gabriel (2). Biesz *en combinant ce qui me paralt dtre Tid^e essentielle de 
M. Gabriel aveo un th^orime appartenant aux 41teents de Tanalyae’ (Lemma A 
below) was able to prove the theorem directly and without limiting processes. 
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in which ©(*)^ 2 >+ 6 , and the measure of this last set is, after 
(10.19.1), at least a:,. Hence 

(10.19.3) x^^mE. 

Now the set E is composed of those points x for which 

(10.19.4) ^^jy^t)dt>p 

for some | | (x) < x. We can write (10.19.4) in the form 

(10.19.5) j J‘{t)dt-px> j f{t)dt-p^ 
or 

(10.19.6) g(^)>g{^). 

say. Thus E is the set of points in which a certain continuous 
function g(x) assumes a value greater than some at any rate of 
the values which it has assumed before. This property enables us 
to characterise the structure of E. 

Lemma A. The set E is composed of a finite or enumerable system 
of non-overlapping intervals (a^, All of these intervals are open, 

?(«ft)=s'(^fc); 

except possibly when x^l is a point of E, in which case (here is 
one interval (a^ , 1 ) closed on the right, and g (a^) S 9^ ( 1 )> (hough g (a*) 
is not necessarily eqvcl to y(l).® 

In the first place, since g(x) is continuous, E is an open set 
(except possibly for the point x = 1). Hence is a set of intervals 

(afc,i5jb),openif^*<l. 

If < 1 then is not a point of E, and 

(10.19.7) gM^9(Pk) 
by the definition of E. 

Next, suppose that and consider the mimmum of 

g (X) in the interval 0 g ® ^ *1 • This minimum cannot he attmned 
for xu<x^x.y since all such x belong to E, and so g {x)>g{i) tor 
some i<x. Hence it is attained for ® ^ a* . But is not a point 
of E. and therefore g (a*) ^ g («) for aU these *. Hence the mim- 

• All that we need is that y («»)S?(A); but the argument wiU probably be 
clearer if we make the lemma complete. 
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mum is attained at aj^, and Making we 

obtain 

(10.19.8) gMigm-. 

and this, with ( 10. 1 9.7), proves the lemma*^. . 

We can now prove Theorem 384. We may write (10.19.8) in 
the form ' 

J etk 

and from this it follows that 

ajfc)<S j f{x)dx—\ f{x)dx. 

J (tk J St 

Hence, by (10.12.1), 

(10.19.9) p.mE^ f f{x)dx^ f f{x)dx; 

Jb Jo 

and hence, by (10.19.1), 

I ^mE 

(10.19.10) 0(a;o)-e=p^^J^ f{x)dx. 

Finally, sinoe f(x) decreases, it follows from (10.19.10) and 
(10.19.8) that 

0(a:o)-€gi f ’‘f{x)dx. 

*oJ 0 

Since e is arbitrary, this gives (10.18.1), with x^ for x. 

The fimotion-theoretic applications of Theorems 384 and 385 arise 
as follows. Suppose that/ (9) is integrable and has the period 2jr, that 

M(0)-M{e.f)= Max \rf(e+u)du, 

and that N(B) is the similar fimction formed with |/(^+t4}|. These 
functions are of the same type as the %(x) of Theorem 384» but are 
generated by means taken to either side of d. 

Consider now the integral 

(i) h l[j(e+t)x(t. p)dt. 

where x i^ ^ kernel which involves a parameter p and satisfies the 
conditions 

(ii) x(<»P)^0, 

The stemdard examples of such kernels are the 'Poisson kernel * 

_ 1— r* 

^“"1 — 2rcose+r*' 

* The argnment here is due to M. Biesz (see F. Riess, 10). 
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in which p 5= r is positive and less than 1, and the ‘Fej6r kernel’ 

— 

^""nsin® 

in which p = n is a positive integer. The corresponding values of h are 
ti(r, d)f the harmonic function defined by the ‘Poisson integral’ of f{6), 
and an{d)f the Ceskro mean, of order 1, of the Fourier series of f{6). 
Suppose now (a) that f{6) belongs to where Aj> 1, and (6) that x 
satisfies the additional condition 

(iii) i J' dt^A, 

where A is independent of p. It follows from Theorems 385, with 
s (y) = and 327, that M {$) also belongs to And it is easy to deduce 

f»*om (i), (ii), and (iii) that 

\hie,p)\^AM(e), 

where A is again independent of p. Hence h has a majorant (independent 
of p) of the class 

It is easily verified that the Poisson kernel satisfies (iii). Hence 
u{r, 6) possesses a majorant U(B) of the class L*'. The same is true of 
Gn(0), but in this case the proof is not quite so simple, since the Foj4r 
kernel does not satisfy (iii). We can however prove that l<7n(p)l ^AN(e), 
and similar conclusions follow. All this is set out in detail by Hardy 
and Littlewood (8). 


MISCELLANEOUS THEOREMS AND EXAMPLES 
386. If cj ^ ca ^ ^ ^ 0 and the sots (a), (b) are non-negative and 

given except in arrangement, then 

n II 

L L Of^afbf 
rssii «ssl 

is a maximum when (o) and (6) are both in decreasing order. 

[F. Wiener (1).] 

887. It is not true that 

S arh.ca S a+b+Ct+. 




S 

r+«+<=0 


[Trivial: take (a), (b), (c) to be (0, 2, 1), (1, 2, 0), (1, 2, 1). Then 
So,6,c,= 14, Sa,+&.+c,+=12.1 

888. There are sets (a). (6). (c) such that none of 

Sa+b+ff*", S+a6+c+. So++bc+, .... S a 

givea the maximal sum ^abe. .^cientlv smril: and take (a) to 

[Suppose 0<fc<l^d€posi IV and(c) to be formed of any five 

be 0, e, 0, 1, 2, (6) to be k-., h, A+e. 1. 1. W 
different dements.] ^ ^ Theorem 398 below. 
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8g9. If M(x) = i:\r\xr, M{y)^J:\a\y,, 

and pirO, then the substitution O, of § 10.8 deoreases /Lt=ilf (a?) + JIf (y). 

[The theorem is trivial, but may be used to oonstruct another proof of 
Theorem 371, which follows the general lines of that in § 10.8 bui is free 
from an appeal to * continuity *. I 

We use A, C, C\ as in §10.8; there may now be more thm one 
arrangement O, We define L as the sub-olaas of K formed by those mem- 
bers of K for which p is least. If p 4= 0, and A is wrong with respect^ to p, 
then Qp decreases n and does not diminish S. Hence any A of JD is^ O'; 
and we can then show as in § 10.8 that L includes a O.] 

890. In the notation of Theorem 373 

S arh.Ct^ S a++6,c** 

r+Si+t-n 

for every n. 

[Corollary of Theorem 373.] 

391. If (o), (a'), (b), (b'), (c), and (c') are six sets of positive numbers 
subject to (10,6.1), then 

£ arar'b,b/ctc/^ S 
r+«+<=0 r-r«'ff=0 

[Corollary of Theorem 372 if fiirst reduced, by the method of § 10.3, 
to the special case in which every number is 0 or 1.] 

392. If / and g are non-negative, and 

(i) {/(*)} rfaiS jy{g(x)}dx 

for every positive and increasing 8 (y), then 

(ii) /S# 

except perhaps for an enumerable set of values of x. 

[This is the theorem referred to in § 10.18, as proving the equivalence of 
Theorems 384 and 385. It is an analogue of Theorem 107. 

Since the integrals (i) are unaltered when we replace/ and g by / and g, 
we may suppose / and g themselves decreasing, so that/ss/, g^g (except 
perhaps in an enumerable set of points). 

If (ii) is not true for almost all x, we can find a b and a o such that 

(iii) 6<c, f(c)>g(b). 

For, if this were not so, we should have f{b-{^0)^g[b) for all h, and 
f(h)^g (b) at iJl points of continuity of the functions, and therefore except 
in an enumerable set. 

Supposing then that b and c satisfy (iii), we choose r so that 

g{b)<r<f{c) 

and define s(y) by 

= 0 (y<r), a(y) = l (ygr). 

Then f 8{f(x)}dx=f dx^ob^f dirss /%{g(a?)}da?, 

.'f I/S£r Jg;tr Jo 

in contradiction with (i).] 
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893. If Oj. o*. .... 0^7 are non-negative, 

©(n) = 0(n,o)= Max ••• +gn 

!<!>*-■ 11 W— * 

and a bar denotes a rearrangement in decreasing order ( a notation opposite 
to that of § 10.1), then 


(n) ^ 


«i + d2+... + a„ 




394. If the conditions of Theorem 393 are satisfied, and 8 {y) is a positive 
increasing function of then 




±^+...+a„\ 


[The last two theorems are the analogues for finite sums of Theorems 
384 and 385, and the reader will find it instructive to prove them by an 
adaptation of the argument of §§ 10.18—19. The earlier proofs of Hardy 
and Liitlewood and of Gabriel are referred to in § 10.18.] 

395. If Ci^C2^...^c,>0, 

«a» •••! is i'he aggregate of the c and d rearranged in decreasing 
C„ = Ci-l-c,H-...-(-c„, 

andZ>„ and are defined similarly; and « (t/) is positive and increasing; then 

»(Oi) +« + ...+S +8(D^) + a(^^) + ... H-a 


^8 ( El ) -{-8 


+ ...+« 


[This is a special case of Theorem 394. For a direct proof by induction, 
due to Chaundy, see Hardy and Littlewood (8): the theorem is one of 
the lemmas on which they based their proof of Theorem 394.] 

396. If p, q, P, Q are positive integers and « (y ) is positive and increasing. 


1 \^/ afi p+g+i \ 


\nj p+g+l \ ^ / 


[(i) and (ii) follow from Theorem 395 by appropriate specialisation, and 
(iii), which is true whether p and g are integers or not, is a corollary. A 
case of (iii) is 

(O>0,6>0, 0<a;<l): 

this may naturally be proved independently (and with < ), for examp 
aa an application of Theorem 103.] 

807. If a, b, a, are positive and « positive and moreasing, th«i 
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898. If A; > 1, and © (x) is defined as in Theorem 384, then 

jy{x)dx^{^^y jy^{x)dx. 

[From Theorems 385 and 327. There is of course a corresponding theo- 
rem for finite sums. This theorem has particularly important applicajfcions.] 

399. In order that an integrable fimction ^ {x) should have the property 

j 8{x)^{x)dx'^0, \ 

for all positive, increasing, and bounded 8(x), it is necessary and sufficient 


that 


y 4,{t)df^0 (Ogaigl). 


\ 


To prove the condition necessary, specialise 8(x) appropriately; to 
prove it sufficient, integrate partially or use the second mean value 
theorem. The condition is certainly satisfied if there is a ^ between 0 
and 1 such that ^(a;) ^ 0 for x>^y for a? < ^, and 


f. 


^(a;)da?=0. 


Theorem 397 is a special case of this theorem (after a simple trans- 
formation).] 

400. If E and f are functions of x subject to 


then 


«>i,. 


[Suppose that f{x) is always 0 or 1, and that E {x) is the measure of the 
part of (0, x) in which/(a;) = 1, and apply Theorem 385.] 

‘p g' 


401. If 


P>1, 


q>l. ^ + A=2- 

P Q 


A<1- 


h + k^O, 


and^ + A!>0if- + - =1, then 
P g 


/:/. 




i/« 


[Here, and in Theorems 402 and 403, K denotes a positive number 
depending only on the parameters of the theorem (here p, g, h. A:).] 


402. If 
then 


p>l, 0ga<i, 


P CCp 
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where/«isdefin©da8m(l0.17.6). The result is stiH troe if a^l/p, when 
the second condition on q may be omitted. 

[For the last two theorems see Hardy and Littlewood (6). The case 
gives roo 

(x-^foi)^dx^K 1^ fpdx: 

compare Theorem 329.] 

403. The result of Theorem 383 is not necessarily true when p = 1 , 
[Define /(a?) by 

m=l(ioeiy'‘(0<x$i), =0 (*>i). 

where jS>l. Then 

Mx]=K “ (x-y)<‘-^dy 


Here 

f is L, but/flf is only if 


p=h q= 
0-1 




i\i-» 


i_ 

l-et' 


1 — a 


>1, iJ>2-a.] 


404. Suppose that /(») is defined in (-1, 1) and has a continuous 
derivative/'(x) which vanishes only at a finite number of points, and that 
/(x)^0, /(-1)=/(1)=0. 

Then the length of the curve y=/(») is greater than that of y=/*(®), 
unless /(x) =/♦(*). 

[See Steiner (1, n, 266). K 0<y< FaMax/ then (except perhaps for 
a finite number of values of y) the equation y=/(x) has an even number 
2n (depending on y) of roots. If we denote these roots, in ascending otdei, 
by Xi , X, , . . . , X|Hi • *“<1 derivative of x, with respect to y by x/, then, 
by Theorem 25, 

2 |^{l+[ 4 S{ - l)’'x/]*}idyg S(l+»/*)*%- 

There is equality only if n = 1 for all y, and Xi= -x,.] 

406. Suppose that/(x, y)g0 for all x. y, and that then^M(s)of 
the set in wWch/(x, y)S* is finite for all positive *; define p{z) by 

and write 'Appropriate con^^^^^ 

where p-^ is the mverse of p. than that of 

regularity) the area of the surface *-/(*• y) P* 

z=f*{x,y). 

because it mvolves a two-r 
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ON STRICTLY POSITIVE FORMS 


11.1. Our aim in Appendix I is to present an elementary 
proof due to W. Habicht^ for an important particular c^se of 
the theorem of Hilbert and Artin stated in § 2.23. 

The term ‘form* will be used as an abbreviation for ‘a hbmo- 
geneous polynomial with real coefl&cients*. A form 


h — F X2t . . 

is said to be strictly positive if F > 0 for real values . . . , 

of the variables unless Xi—X 2 =..-=x^= 0 ; we assume further 
that F is not a constant. We wish to prove the following theorem. 


406. Any strictly positive form F can be expressed as 


( 11 . 1 . 1 ) 




where and are suitably chosen forms. 

The proof is based on Theorem 56. In that theorem the term 
‘a form with positive coefficients’ is used implicitly with the 
following meaning (the reader should verify that with this 
meaning the proof in § 2.24 does apply) : if = G(xit a^g, . . . , a;^) is 
a form with positive coefficients and its degree is jr, then 


where 


G = Saa?!®! iCg^* . . . 




for all integers a^, ag, . . . , satisfying the conditions 


aiSO, ag^O, ..., ai-fag+ ... + a^ = ^. 

We arrive at 406 in three steps from 56. 

» *Cber die Zerlegung strikte definitor Formen in Quadrate*, Commentarii Math. 
Hdveiici, 12 (1940), 317-322. 
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11-2. A particular case. Let the form K bo 

even in each of its variables. That is, 

X 3 , ...,xJ = K (Xi, -Xs„ ...,xj = ... 

K (ajj, . , . , x^_j^, Xin ) = A (a^j, x^t . . . , x^). 

Since only even powers of the variables occm in such a form K 
we have 


(11.2.1) A (a:^, x^, ,..,x^—1j(x-^, x^, ..., a:^*), 

where L{yx,yi, ..., 2 /,„) is a form. If the degrees of A and L are 
h and I respectively we have of course h — 21. 

Assume now that K {x^,x^, ...,x„) is strictly positive. Then 

L(yx,yi,...,yJ>(i 

for y ^ 0, Sy > 0, and so, by Theorem 56, 


( 11 . 2 . 2 ) 


A = 


G 

(yi + 2/2+"-+2/m)*’’ 


where G is a form with positive coefficients. Any positive number 
is the square of some positive number, and we can write G as 


(1 1.2.3) G = 


where the q are positive numbers. It follows from (11.2.1), 
(11.2.2), and (11.2.3) that 

, “• a;,"* . . . *„,““)* 

(11.2.4) K{Xx,Xi, = ^ .TT+'^z^p* 


K is thus expressible in the desired form ( 1 1 . 1 . 1 ). In fact, the forms 
M^ and reduce to monomials.* According to the remarks at 
the end of § 11.1 none of the monomials qx^’^^x^^* ... a:„“" com- 
patible with the degree of K vanishes. 

11.8. An intermediate representation. We consider now 
any strictly positive form F (Xi,...,x^) and the product of 2 
factors 

(11.3.1) 11 II ... n i^’((-i)‘'‘*i>(-i)''‘® 

»,-0lh-0 Vm-0 =K(Xi,Xx,..;Xj. 


» Polya (3). 



302 


▲FFSSNDIX I 


The form jK" (* 1 , is obviously even with respect to each 

of its variables, and strictly positive as a product of strictly 
positive factors. (1 1.2.4) accordingly applies to K, and combining 
this with (11.3.1) we arrive at the conclusion: given any ^ricUy 
positive form F we can find a form P such that the product FP is 
a sum of squares of monomials: 

(11.3.2) FP=I.(qZy°^xf^...xJ^)\ 

There is as yet no reason to suppose that P is a sum of squcure^ and 
our goal (11.1.1) is not yet attained. 

11.4. Proof of Theorem 406. Consider again the strictly 
positive form F. Let 2n be its degree. We introduce (this is the 
key-idea) an extra variable u and consider the form 

u^ + F(xi,x^,...,x„) 

of m + 1 variables; this is obviously strictly positive. After §11.3 
we can find a form P* {x^, ...,x„,u) such that 

(11.4.1) {u^^+F{Xi, ...,xJ)P* («! x^,u) 

= S (qxj^i . . . x„“'»w“)®. 

We attain our goal by 'reducing mod («*" + F) ’ the right-hand 
side of (11.4.1), and proceed to explain this process in detail. 

We consider the general term of the sum of the right-hand side 
of (11.4.1). Dividing a by 2n we have two non-negative integers 
fi and y such that 

(11.4.2) a=2»/3-hy 
and 

(11.4.3) 0;gy<2»-l. 

Then 

(1 1 .4.4) «“ = = [(tt*»)^ - ( - P)^ uy+(-F)P uy. 

The expression in square brackets is divisible by «*”— (— P). 
Hence, from (11.4.4), 

(11.4.6) =(tt*"-f-P)0-)-JB«’', 

where ...,®„j)=grxi“»...x„“»(-P)^ 
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depends on m variables only and does not vanish identically. 
(The form Q = Q (x^, . . . , u) may depend on m + 1 variables and 
may vanish identically; Q and Huy are the quotient and the re- 
mainder, respectively, in the division of the monomial on the 
left-hand side of (11.4.5) by considered as a polynomial 

in u alone with coefiBcients depending on Introducing 

(11.4.5) on the right-hand side of (11.4.1) and transporting the 
terms divisible by JF to the left-hand side, we obtain 

(11.4.6) 

y=0 d 

Observe that y satisfies (1 1.4.3), and also that, in accordance with 
the remarks at the end of § 11.1 and § 11.2, the sum on the right- 
hand side of (11.4.1) has some terms with a = 0 and, therefore, in 
view of (11.4.2), with y = 0. The sum 

^Rh 

$ 

is therefore not empty and docs not vanish identically. 

From (11.4.6) it follows that P** as a polynomial in u is of 
degree < 2n~ 2. We can therefore write (11.4.G) as 

2ti-1 

{u^- + F) i: T^{x^,...,xjuy= S 

y = 0 ^ 

CoMparing the coefficients of vP and u~^ we obtain 


d 

From the first equation we conclude that Tq does not vanish 
identically; the two equations then give 

^ 9 


which proves the desired (11.1.1) and Theorem 406. 
The proof actually yields more than we have stated, 


We have 


ih fact 
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407. If (he coefficients of the strictly positive form F are regional 
numbers, the representation (11.1.1) can be chosen so that (he 
coefficients of the forms and Nf are also rational numbers. 

We need only one modification. In deriving (11.2.3) ^ used 
the obvious remark : any positive number is the square of a ^rtain 
positive number. We have now to use instead the equally obvious 
remark: any positive rational number is a sum of squares of 
positive rational numbers. (In fact, if r and s are positive integers, 


r _r8 
s~s^ 



a sum of rs terms.) 
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THORIN’S PROOP AND EXTENSION 
OF THEOREM 295* 


12.1. This is both brilliant and easy to follow: the reader who 
wishes to avoid all difficulties may substitute the present appen- 
dix for § 8. 1 5. The proof, however, depends essentially on complex 
function-theory, which we have systematically excluded from 
our main account, and the argument of §8.15, in spite of its 
greater difficulty and more limited range, retains its interest and 
importance. The extended theorem is 


408. Suppose that is the maximum of \ A \, where 

m n 

A=A{x,y)='S, Z aijXiy^ 

i«l i«l 


for 


m ” o 

( 12 . 1 . 1 ) 


it being understood that ifa^dor p=0 (or both), these ineqmlities 
are to be replaced by la:,Ul and \y,\^l, respectively. Then 
log JIf, ^ is convex in the quadrant a ^ 0, ^ ^ 0. 

The proof turns on the following principles, all of them obvious, 
(i) The upper bound^ of a family (possibly infinite m number) 
of convef functions is convex; (ii) the limit of a convergent 
sequence of convex functions is convex; (iii) m ^“*^8 
bound taken with respect to a number of independent condition 
we may take successive upper bounds, and in any order; 

example, jiax/(a;,y)=Max(Max/(x,y)). 

^ ix) iv) 


{x,v) 


still more general forms. .ipogt upper bound’. 

» ‘Upper bound’ throughout means least pp 


HI 
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We need the following 

Lemma. Let S = be a finite sum with real b and s — a-\-it 
with real a and t, and let 

m ( g ) = Max I S I . 

(0 

Then log m (a) is a c&nvex function of a. j 

m (a) is continuous in the a, 6; it is therefore, by principle (ii), 
enough to establish the case in which all b are rational. S is xhen 
a poljmoniial in 2 = where Q is a common denominator of the 
6’s, and the result is then a consequence of Hadamard’s ‘three 
circles theorem’. 

\lf f(z) is regular in an annulus < 1 2 1 ^ rg, and M (r) is the 
maximum of\ f{z) | for | 2 1 = r, then log ilf (r) is a convex function 
oflogr. 

We give the proof for completeness. Choose /a so that 
rf (fi) = (fg) = T. 

Then zff{z) is regular at every point of the annulus and has a 
modulus one-valued in the annulus. Its maximum modulus for 
the boundary is T; it follows that r^^M (r) g T, whence we have 

log M(r)'^t log M (r^) -}- ( 1 — /) log M (rf) 

for logr = dogri-|-(l — ^llogrg.] 

Coming now to Theorem 408, it is enough^ by continuity (as 
was observed in §8.13), to prove logM^ ^ convex in a on any 
segment a 2= AiU, jS = j80 -h Aga lying in the open quadrant a > 0, 
^ > 0. Now for an internal (a, fi) we may write 

Vi = S 0, 

and then, for varying (real) tp, and non-n^ative i, ij varying 
subject to 

(12.1.2) S^^l, 

(a condition independent of a, )8), 

Jf„.^=Max 1 |. 



APPJ5NDIX II 


307 

If in this we replace a by 5 = a -h then for any given real t the 
upper bound is unaltered + is merely ‘translated'). We can 
now add an operation Max for varying t\ and by principle (iii) 
we can make this the inner operation: 

Q = Max (Max 1 SSaf 1 ). 

(t) 

Now for fixed <^, */r, rj we may suppress in SS any term for which 
either the f or the is 0 (the indices are positive), obtaining a 
modified double sum SS, say. Then 

If . = Max {a), m(a ) = Max \ F {s) \ . 

0,irX,n (« 

But F is a S of the Lemma, so that its logm (a) is convex (for all 
a, and in particular on the segment). The upper bound of the 
family of logw (a) is now convex, by principle (i), and our proof 
is completed. 
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ON HILBERT’S INEQUALITY 


13.1. After the remarks in § 4.6 about the limitations of nhe 
theory of maxima and minima of functions of several variable it 
is of interest that some of the more sophisticated inequalities 
are capable of simple proof by this method. We take for illustfi^- 
tion Hilbert’s inequality, but for simplicity with the 6’s identical 
with the o’s:* 

„.„_om+n+l 0 

unless all the a’s are 0. 

We may suppose more than one a different from 0, the excepted 
cases being trivial. Consider 

N „ „ 

F{a) = F{ao,a^,...,ajf)= S 

subject to 
(13.1.1) 


m,n 


.om + n+l 


N 


0{a)==I.aJ = t, 
0 


where t is a positive constant. If any is 0, a small increment 
8 in this produces an increase of 8* in (? and one of order 8 
in F, and so increases the ratio FjO. It follows, since F is con- 
tinuous, that subject to(13.1.1)i^ attains its maximum F*=F*{t) 
for some set of a„, none of which is 0. 

For this set we have 


a A^- = 0 (n^N) 

oa„ da„ 


for some A, independent of n. This gives 


• J. W. S. Cassels, ‘An elementary proof of some inequalities’, J(mm, L,M.8, 
23 (1948), 285-290. There are similar proofe for the general case, and for Hardy’s 
and Carleman’s inequalities* 
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and multiplying by a„ and adding we have 

F*{t)=U. 

Let attain its maximum at m=ii. Then (13.1.2) 

withn=jLi gives 

Sincje {(a:+f4+ 1) is strictly convex, 


^ 1 dx 

JiQ(in+n + l){m+\^"^ J -t («+M+l)(*+i? 


Jo Jo 


and it follows, since + 0, that A < w. 

For any non-null set {%) we have now 

N 

F (o) ^ F* (G) = A(3 < 7r(? = TT S o„* 
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